metadata
license: apache-2.0
base_model:
- genmo/mochi-1-preview
pipeline_tag: text-to-video
tags:
- infinite zoom
- art style
- mochi
- diffusion
widget:
- text: >-
Human fingers pinching to zoom on an infinite zoom canvas, a detailed
cityscape at night, illuminated by neon lights and bustling with activity.
The zoom focuses on a lit billboard advertising a soda can, transitioning
into the sparkling surface of the liquid. As the zoom deepens, microscopic
bubbles transform into entire ecosystems of floating islands within the
soda.
output:
url: 0.mp4
Fine-Tuning Mochi Text-to-Video: InfiniteZoom-Mochi
This project demonstrates the fine-tuning of the Mochi Text-to-Video model using a LoRA (Low-Rank Adaptation) approach, focusing on the infinite zoom art style.
Training Details
- Model Base: genmo/mochi-1-preview
- Fine-Tuning Dataset: 23 short video clips of infinite zoom art style, and .txt descriptions
- Training Settings :: 37 frames
- Training Hardware: H100 GPU
- Training Duration: 2h