metadata

license: apache-2.0
base_model:
  - genmo/mochi-1-preview
pipeline_tag: text-to-video
tags:
  - infinite zoom
  - art style
  - mochi
  - diffusion
widget:
  - text: >-
      Human fingers pinching to zoom on an infinite zoom canvas, a detailed
      cityscape at night, illuminated by neon lights and bustling with activity.
      The zoom focuses on a lit billboard advertising a soda can, transitioning
      into the sparkling surface of the liquid. As the zoom deepens, microscopic
      bubbles transform into entire ecosystems of floating islands within the
      soda.
    output:
      url: 0.mp4

Fine-Tuning Mochi Text-to-Video: InfiniteZoom-Mochi

This project demonstrates the fine-tuning of the Mochi Text-to-Video model using a LoRA (Low-Rank Adaptation) approach, focusing on the infinite zoom art style.

Training Details

Model Base: genmo/mochi-1-preview
Fine-Tuning Dataset: 23 short video clips of infinite zoom art style, and .txt descriptions
Training Settings :: 37 frames
Training Hardware: H100 GPU
Training Duration: 2h

Prompt: Human fingers pinching to zoom on an infinite zoom canvas, a detailed cityscape at night, zoom focuses on a can, all surface around it is made of liquid and objects swimming in it.

Prompt: Human fingers pinching to zoom on an infinite zoom canvas, spaceship going through space.

Prompt: Human fingers pinching to zoom on an infinite zoom canvas, orange cat in the middle of a canvas, looking upward.