--- language: - en library_name: stable-audio-tools license: other license_name: stable-audio-community pipeline_tag: text-to-audio tags: - text-to-audio inference: true widget: - src: ./assets/demo_cfg_3_00000001.wav example_title: 'Unconditional (blank prompt)' parameters: negative_prompt: 'blurry, cropped, ugly' - text: 'Chill soft wake up, slow down alt, night get lucky dance, relax music introspective 2017 2018 2019 2020 2021 2022, acoustic atmosphere uplifting dreams, dreamy indie pop, electric trap, percussion, higher reverb, really intensity melody, goodbye' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/music_3_illustration.jpg - text: 'Chill hip-hop beat, chillhop, lofi pop, favorite music' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/music_4_illustration.jpg --- You can use this model in [stable-audio-tools](https://github.com/Stability-AI/stable-audio-tools), fine-tuned on my favorite song from my [personal playlist](https://www.youtube.com/watch?v=dQw4w9WgXcQ).
Comparison Table
Prompt Base Model Fine-Tuned
Feel-Good Vibes and Dramatic Atmosphere, alone hero, epic, get good yeah, better last night pop, follow follow, echoing, powerful vocal driving melancholic vocals dramatic Features rising tension, progressive electro house, far away, by Alan Walker, popular song tempo, girl, female synth, popular, titled: legend never die
Beautiful music progressive electro slap mood, upbeat, heavy bass, melancholic, hopeful; drums, vocals, dynamic shifts, building intensity, run far away, repetitive, let let go, think of us, titled popular lyrics: Mirror's Edge, popular lyrics say: "still still alive"
Chill soft wake up, slow down alt, night get lucky dance, relax music introspective 2017 2018 2019 2020 2021 2022, acoustic atmosphere uplifting dreams, dreamy indie pop, electric trap, percussion, higher reverb, really intensity melody, goodbye
Chill hip-hop beat, chillhop, lofi pop, favorite music
Showcase Model Details

Test Settings:

  • CFG: 7.0
  • Steps: 100
  • Seed: -1

Prompt have been chosen based on the top tagged words except last prompt which is used to compare effect on non-trained tags

Training ### Dataset: 2-3 min music length - All of my Liked music [download and auto label](https://pastebin.com/z1bkZyqe) so mostly copyright. - Total number of samples: ~1383 - `"random_crop": true` in [dataset_config.json](https://github.com/Stability-AI/stable-audio-tools/issues/99#issuecomment-2174885688) ### Settings: - Training epochs: 1 - Training steps: 1383 - Learning rate: 1e-05