martintomov
commited on
gallery
Browse files
README.md
CHANGED
@@ -8,6 +8,10 @@ tags:
|
|
8 |
- art style
|
9 |
- mochi
|
10 |
- diffusion
|
|
|
|
|
|
|
|
|
11 |
---
|
12 |
|
13 |
# Fine-Tuning Mochi Text-to-Video: InfiniteZoom-Mochi
|
@@ -20,12 +24,8 @@ This project demonstrates the fine-tuning of the **Mochi Text-to-Video** model u
|
|
20 |
- **Fine-Tuning Dataset**: 23 short video clips of infinite zoom art style
|
21 |
- **Training Settings :**: 37 frames
|
22 |
- **Training Hardware**: H100 GPU
|
23 |
-
- **Training Duration**:
|
24 |
|
25 |
---
|
26 |
|
27 |
-
|
28 |
-
|
29 |
-
Below is an example of the generated video output:
|
30 |
-
|
31 |
-
### Samples
|
|
|
8 |
- art style
|
9 |
- mochi
|
10 |
- diffusion
|
11 |
+
widget:
|
12 |
+
- text: Human fingers pinching to zoom on an infinite zoom canvas, a detailed cityscape at night, illuminated by neon lights and bustling with activity. The zoom focuses on a lit billboard advertising a soda can, transitioning into the sparkling surface of the liquid. As the zoom deepens, microscopic bubbles transform into entire ecosystems of floating islands within the soda.
|
13 |
+
output:
|
14 |
+
url: 0.mp4
|
15 |
---
|
16 |
|
17 |
# Fine-Tuning Mochi Text-to-Video: InfiniteZoom-Mochi
|
|
|
24 |
- **Fine-Tuning Dataset**: 23 short video clips of infinite zoom art style
|
25 |
- **Training Settings :**: 37 frames
|
26 |
- **Training Hardware**: H100 GPU
|
27 |
+
- **Training Duration**: 2h
|
28 |
|
29 |
---
|
30 |
|
31 |
+
<Gallery />
|
|
|
|
|
|
|
|