Nekochu commited on
Commit
d61e375
1 Parent(s): e1d6911

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +151 -143
README.md CHANGED
@@ -1,143 +1,151 @@
1
- ---
2
- language:
3
- - en
4
- library_name: stable-audio-tools
5
- license: other
6
- license_name: stable-audio-community
7
- pipeline_tag: text-to-audio
8
- tags:
9
- - text-to-audio
10
- inference: true
11
- widget:
12
- - src: ./assets/demo_cfg_3_00000001.wav
13
- example_title: 'Unconditional (blank prompt)'
14
- parameters:
15
- negative_prompt: 'blurry, cropped, ugly'
16
- - text: 'Chill soft wake up, slow down alt, night get lucky dance, relax music introspective 2017 2018 2019 2020 2021 2022, acoustic atmosphere uplifting dreams, dreamy indie pop, electric trap, percussion, higher reverb, really intensity melody, goodbye'
17
- parameters:
18
- negative_prompt: 'blurry, cropped, ugly'
19
- output:
20
- url: ./assets/music_3_illustration.jpg
21
- - text: 'Chill hip-hop beat, chillhop, lofi pop, favorite music'
22
- parameters:
23
- negative_prompt: 'blurry, cropped, ugly'
24
- output:
25
- url: ./assets/music_4_illustration.jpg
26
- ---
27
-
28
- <details>
29
- <summary>Comparison Table</summary>
30
- <table style="width:100%; border-collapse: collapse;">
31
- <colgroup>
32
- <col style="width: 25%;">
33
- <col style="width: 37.5%;">
34
- <col style="width: 37.5%;">
35
- </colgroup>
36
- <tr>
37
- <th>Prompt</th>
38
- <th>Base Model</th>
39
- <th>Fine-Tuned</th>
40
- </tr>
41
- <tr>
42
- <td style="font-size: smaller; padding: 0.5px; word-wrap: break-word;">
43
- Feel-Good Vibes and Dramatic Atmosphere, alone hero, epic, get good yeah, better last night pop, follow follow, echoing, powerful vocal driving melancholic vocals dramatic Features rising tension, progressive electro house, far away, by Alan Walker, popular song tempo, girl, female synth, popular, titled: legend never die
44
- </td>
45
- <td style="padding: 0.5px; vertical-align: middle; text-align: center;">
46
- <audio controls style="width: 100%;">
47
- <source src="https://huggingface.co/Nekochu/stable-audio-open-1.0-Music/resolve/main/assets/1_Base_stable-audio-open-1.0.wav" type="audio/wav">
48
- </audio>
49
- </td>
50
- <td style="padding: 0.5px; vertical-align: middle; text-align: center;">
51
- <audio controls style="width: 100%;">
52
- <source src="https://huggingface.co/Nekochu/stable-audio-open-1.0-Music/resolve/main/assets/1_1epoch.wav" type="audio/wav">
53
- </audio>
54
- </td>
55
- </tr>
56
- <tr>
57
- <td style="font-size: smaller; padding: 0.5px; word-wrap: break-word;">
58
- Beautiful music progressive electro slap mood, upbeat, heavy bass, melancholic, hopeful; drums, vocals, dynamic shifts, building intensity, run far away, repetitive, let let go, think of us, titled popular lyrics: Mirror's Edge, popular lyrics say: "still still alive"
59
- </td>
60
- <td style="padding: 0.5px; vertical-align: middle; text-align: center;">
61
- <audio controls style="width: 100%;">
62
- <source src="https://huggingface.co/Nekochu/stable-audio-open-1.0-Music/resolve/main/assets/2_Base_stable-audio-open-1.0.wav" type="audio/wav">
63
- </audio>
64
- </td>
65
- <td style="padding: 0.5px; vertical-align: middle; text-align: center;">
66
- <audio controls style="width: 100%;">
67
- <source src="https://huggingface.co/Nekochu/stable-audio-open-1.0-Music/resolve/main/assets/2_1epoch.wav" type="audio/wav">
68
- </audio>
69
- </td>
70
- </tr>
71
- <tr>
72
- <td style="font-size: smaller; padding: 0.5px; word-wrap: break-word;">
73
- Chill soft wake up, slow down alt, night get lucky dance, relax music introspective 2017 2018 2019 2020 2021 2022, acoustic atmosphere uplifting dreams, dreamy indie pop, electric trap, percussion, higher reverb, really intensity melody, goodbye
74
- </td>
75
- <td style="padding: 0.5px; vertical-align: middle; text-align: center;">
76
- <audio controls style="width: 100%;">
77
- <source src="https://huggingface.co/Nekochu/stable-audio-open-1.0-Music/resolve/main/assets/3_Base_stable-audio-open-1.0.wav" type="audio/wav">
78
- </audio>
79
- </td>
80
- <td style="padding: 0.5px; vertical-align: middle; text-align: center;">
81
- <audio controls style="width: 100%;">
82
- <source src="https://huggingface.co/Nekochu/stable-audio-open-1.0-Music/resolve/main/assets/3_1epoch.wav" type="audio/wav">
83
- </audio>
84
- </td>
85
- </tr>
86
- <tr>
87
- <td style="font-size: smaller; padding: 0.5px; word-wrap: break-word;">
88
- Chill hip-hop beat, chillhop, lofi pop, favorite music
89
- </td>
90
- <td style="padding: 0.5px; vertical-align: middle; text-align: center;">
91
- <audio controls style="width: 100%;">
92
- <source src="https://huggingface.co/Nekochu/stable-audio-open-1.0-Music/resolve/main/assets/4_Base_stable-audio-open-1.0.wav" type="audio/wav">
93
- </audio>
94
- </td>
95
- <td style="padding: 0.5px; vertical-align: middle; text-align: center;">
96
- <audio controls style="width: 100%;">
97
- <source src="https://huggingface.co/Nekochu/stable-audio-open-1.0-Music/resolve/main/assets/4_1epoch.wav" type="audio/wav">
98
- </audio>
99
- </td>
100
- </tr>
101
- </table>
102
-
103
- <div>
104
- <Gallery />
105
- <div class="not-prose mb-2 flex flex-wrap items-start gap-4 sm:mr-6 sm:flex-row">
106
- <audio controls style="width: calc(50% - 8px);">
107
- <source src="./assets/3_1epoch.wav" type="audio/wav">
108
- </audio>
109
- <audio controls style="width: calc(50% - 8px);">
110
- <source src="./assets/4_1epoch.wav" type="audio/wav">
111
- </audio>
112
- </div>
113
- </div>
114
-
115
-
116
- <details open>
117
- <summary>Showcase Model Details</summary>
118
- <div>
119
- <h3>Test Settings:</h3>
120
- <ul>
121
- <li>CFG: 7.0</li>
122
- <li>Steps: 100</li>
123
- <li>Seed: -1</li>
124
- </ul>
125
- <p>Prompt have been chosen based on the top tagged words except last prompt which is used to compare effect on non-trained tags</p>
126
- </div>
127
- </details>
128
- </details>
129
-
130
- <details>
131
- <summary>Training</summary>
132
-
133
- ### Dataset: 2-3 min music length
134
- - All of my Liked music [download and auto label](https://pastebin.com/z1bkZyqe) so mostly copyright.
135
- - Total number of samples: ~1383
136
- - `"random_crop": true` in [dataset_config.json](https://github.com/Stability-AI/stable-audio-tools/issues/99#issuecomment-2174885688)
137
-
138
- ### Settings:
139
- - Training epochs: 1
140
- - Training steps: 1383
141
- - Learning rate: 1e-05
142
-
143
- </details>
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ library_name: stable-audio-tools
5
+ license: other
6
+ license_name: stable-audio-community
7
+ pipeline_tag: text-to-audio
8
+ tags:
9
+ - text-to-audio
10
+ inference: true
11
+ widget:
12
+ - src: ./assets/demo_cfg_3_00000001.wav
13
+ example_title: 'Unconditional (blank prompt)'
14
+ parameters:
15
+ negative_prompt: 'blurry, cropped, ugly'
16
+ - text: 'Chill soft wake up, slow down alt, night get lucky dance, relax music introspective 2017 2018 2019 2020 2021 2022, acoustic atmosphere uplifting dreams, dreamy indie pop, electric trap, percussion, higher reverb, really intensity melody, goodbye'
17
+ parameters:
18
+ negative_prompt: 'blurry, cropped, ugly'
19
+ output:
20
+ url: ./assets/music_3_illustration.jpg
21
+ - text: 'Chill hip-hop beat, chillhop, lofi pop, favorite music'
22
+ parameters:
23
+ negative_prompt: 'blurry, cropped, ugly'
24
+ output:
25
+ url: ./assets/music_4_illustration.jpg
26
+ ---
27
+
28
+
29
+
30
+ <style>
31
+ .spoiler{background:black;color:black;text-decoration:none!important}.spoiler a{color:black;text-decoration:underline}.spoiler:hover,.spoiler:hover a{color:white}
32
+ </style>
33
+
34
+ You can use this model in [stable-audio-tools](https://github.com/Stability-AI/stable-audio-tools), fine-tuned on my favorite song <span class="spoiler">from my [personal playlist](https://www.youtube.com/watch?v=dQw4w9WgXcQ).</span>
35
+
36
+ <details>
37
+ <summary>Comparison Table</summary>
38
+ <table style="width:100%; border-collapse: collapse;">
39
+ <colgroup>
40
+ <col style="width: 25%;">
41
+ <col style="width: 37.5%;">
42
+ <col style="width: 37.5%;">
43
+ </colgroup>
44
+ <tr>
45
+ <th>Prompt</th>
46
+ <th>Base Model</th>
47
+ <th>Fine-Tuned</th>
48
+ </tr>
49
+ <tr>
50
+ <td style="font-size: smaller; padding: 0.5px; word-wrap: break-word;">
51
+ Feel-Good Vibes and Dramatic Atmosphere, alone hero, epic, get good yeah, better last night pop, follow follow, echoing, powerful vocal driving melancholic vocals dramatic Features rising tension, progressive electro house, far away, by Alan Walker, popular song tempo, girl, female synth, popular, titled: legend never die
52
+ </td>
53
+ <td style="padding: 0.5px; vertical-align: middle; text-align: center;">
54
+ <audio controls style="width: 100%;">
55
+ <source src="https://huggingface.co/Nekochu/stable-audio-open-1.0-Music/resolve/main/assets/1_Base_stable-audio-open-1.0.wav" type="audio/wav">
56
+ </audio>
57
+ </td>
58
+ <td style="padding: 0.5px; vertical-align: middle; text-align: center;">
59
+ <audio controls style="width: 100%;">
60
+ <source src="https://huggingface.co/Nekochu/stable-audio-open-1.0-Music/resolve/main/assets/1_1epoch.wav" type="audio/wav">
61
+ </audio>
62
+ </td>
63
+ </tr>
64
+ <tr>
65
+ <td style="font-size: smaller; padding: 0.5px; word-wrap: break-word;">
66
+ Beautiful music progressive electro slap mood, upbeat, heavy bass, melancholic, hopeful; drums, vocals, dynamic shifts, building intensity, run far away, repetitive, let let go, think of us, titled popular lyrics: Mirror's Edge, popular lyrics say: "still still alive"
67
+ </td>
68
+ <td style="padding: 0.5px; vertical-align: middle; text-align: center;">
69
+ <audio controls style="width: 100%;">
70
+ <source src="https://huggingface.co/Nekochu/stable-audio-open-1.0-Music/resolve/main/assets/2_Base_stable-audio-open-1.0.wav" type="audio/wav">
71
+ </audio>
72
+ </td>
73
+ <td style="padding: 0.5px; vertical-align: middle; text-align: center;">
74
+ <audio controls style="width: 100%;">
75
+ <source src="https://huggingface.co/Nekochu/stable-audio-open-1.0-Music/resolve/main/assets/2_1epoch.wav" type="audio/wav">
76
+ </audio>
77
+ </td>
78
+ </tr>
79
+ <tr>
80
+ <td style="font-size: smaller; padding: 0.5px; word-wrap: break-word;">
81
+ Chill soft wake up, slow down alt, night get lucky dance, relax music introspective 2017 2018 2019 2020 2021 2022, acoustic atmosphere uplifting dreams, dreamy indie pop, electric trap, percussion, higher reverb, really intensity melody, goodbye
82
+ </td>
83
+ <td style="padding: 0.5px; vertical-align: middle; text-align: center;">
84
+ <audio controls style="width: 100%;">
85
+ <source src="https://huggingface.co/Nekochu/stable-audio-open-1.0-Music/resolve/main/assets/3_Base_stable-audio-open-1.0.wav" type="audio/wav">
86
+ </audio>
87
+ </td>
88
+ <td style="padding: 0.5px; vertical-align: middle; text-align: center;">
89
+ <audio controls style="width: 100%;">
90
+ <source src="https://huggingface.co/Nekochu/stable-audio-open-1.0-Music/resolve/main/assets/3_1epoch.wav" type="audio/wav">
91
+ </audio>
92
+ </td>
93
+ </tr>
94
+ <tr>
95
+ <td style="font-size: smaller; padding: 0.5px; word-wrap: break-word;">
96
+ Chill hip-hop beat, chillhop, lofi pop, favorite music
97
+ </td>
98
+ <td style="padding: 0.5px; vertical-align: middle; text-align: center;">
99
+ <audio controls style="width: 100%;">
100
+ <source src="https://huggingface.co/Nekochu/stable-audio-open-1.0-Music/resolve/main/assets/4_Base_stable-audio-open-1.0.wav" type="audio/wav">
101
+ </audio>
102
+ </td>
103
+ <td style="padding: 0.5px; vertical-align: middle; text-align: center;">
104
+ <audio controls style="width: 100%;">
105
+ <source src="https://huggingface.co/Nekochu/stable-audio-open-1.0-Music/resolve/main/assets/4_1epoch.wav" type="audio/wav">
106
+ </audio>
107
+ </td>
108
+ </tr>
109
+ </table>
110
+
111
+ <details open>
112
+ <summary>Showcase Model Details</summary>
113
+ <div>
114
+ <h3>Test Settings:</h3>
115
+ <ul>
116
+ <li>CFG: 7.0</li>
117
+ <li>Steps: 100</li>
118
+ <li>Seed: -1</li>
119
+ </ul>
120
+ <p>Prompt have been chosen based on the top tagged words except last prompt which is used to compare effect on non-trained tags</p>
121
+ </div>
122
+
123
+ <div>
124
+ <Gallery />
125
+ <div class="not-prose mb-2 flex flex-wrap items-start gap-4 sm:mr-6 sm:flex-row">
126
+ <audio controls style="width: calc(50% - 8px);">
127
+ <source src="./assets/3_1epoch.wav" type="audio/wav">
128
+ </audio>
129
+ <audio controls style="width: calc(50% - 8px);">
130
+ <source src="./assets/4_1epoch.wav" type="audio/wav">
131
+ </audio>
132
+ </div>
133
+ </div>
134
+
135
+ </details>
136
+ </details>
137
+
138
+ <details>
139
+ <summary>Training</summary>
140
+
141
+ ### Dataset: 2-3 min music length
142
+ - All of my Liked music [download and auto label](https://pastebin.com/z1bkZyqe) so mostly copyright.
143
+ - Total number of samples: ~1383
144
+ - `"random_crop": true` in [dataset_config.json](https://github.com/Stability-AI/stable-audio-tools/issues/99#issuecomment-2174885688)
145
+
146
+ ### Settings:
147
+ - Training epochs: 1
148
+ - Training steps: 1383
149
+ - Learning rate: 1e-05
150
+
151
+ </details>