InfiniteZoom-Mochi / README.md
martintomov's picture
dataset clarification
d83f813 verified
|
raw
history blame
1.11 kB
metadata
license: apache-2.0
base_model:
  - genmo/mochi-1-preview
pipeline_tag: text-to-video
tags:
  - infinite zoom
  - art style
  - mochi
  - diffusion
widget:
  - text: >-
      Human fingers pinching to zoom on an infinite zoom canvas, a detailed
      cityscape at night, illuminated by neon lights and bustling with activity.
      The zoom focuses on a lit billboard advertising a soda can, transitioning
      into the sparkling surface of the liquid. As the zoom deepens, microscopic
      bubbles transform into entire ecosystems of floating islands within the
      soda.
    output:
      url: 0.mp4

Fine-Tuning Mochi Text-to-Video: InfiniteZoom-Mochi

This project demonstrates the fine-tuning of the Mochi Text-to-Video model using a LoRA (Low-Rank Adaptation) approach, focusing on the infinite zoom art style.

Training Details

  • Model Base: genmo/mochi-1-preview
  • Fine-Tuning Dataset: 23 short video clips of infinite zoom art style, and .txt descriptions
  • Training Settings :: 37 frames
  • Training Hardware: H100 GPU
  • Training Duration: 2h

Prompt
Human fingers pinching to zoom on an infinite zoom canvas, a detailed cityscape at night, zoom focuses on a can, all surface around it is made of liquid and objects swimming in it.
Prompt
Human fingers pinching to zoom on an infinite zoom canvas, spaceship going through space.
Prompt
Human fingers pinching to zoom on an infinite zoom canvas, orange cat in the middle of a canvas, looking upward.