Dionyssos commited on
Commit
5ffcd95
·
1 Parent(s): 3ac9f34

draft: audiobook

Browse files
Files changed (1) hide show
  1. README.md +16 -2
README.md CHANGED
@@ -50,7 +50,7 @@ pip install -r requirements.txt
50
 
51
  </details>
52
 
53
- Start Flask
54
 
55
  ```
56
  CUDA_DEVICE_ORDER=PCI_BUS_ID HF_HOME=./hf_home CUDA_VISIBLE_DEVICES=2 python api.py
@@ -96,7 +96,7 @@ For SHIFT demo / Collaboration with [SMB](https://www.smb.museum/home/)
96
 
97
  # Live Demo - Paplay
98
 
99
- Flask
100
 
101
  ```python
102
  CUDA_DEVICE_ORDER=PCI_BUS_ID HF_HOME=/data/dkounadis/.hf7/ CUDA_VISIBLE_DEVICES=4 python live_api.py
@@ -113,3 +113,17 @@ python live_demo.py # will ask text input & play soundscape
113
  ```python
114
  CUDA_DEVICE_ORDER=PCI_BUS_ID HF_HOME=/data/dkounadis/.hf7/ CUDA_VISIBLE_DEVICES=4 python demo.py
115
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
50
 
51
  </details>
52
 
53
+ Flask API
54
 
55
  ```
56
  CUDA_DEVICE_ORDER=PCI_BUS_ID HF_HOME=./hf_home CUDA_VISIBLE_DEVICES=2 python api.py
 
96
 
97
  # Live Demo - Paplay
98
 
99
+ Special Flask API for playing sounds live
100
 
101
  ```python
102
  CUDA_DEVICE_ORDER=PCI_BUS_ID HF_HOME=/data/dkounadis/.hf7/ CUDA_VISIBLE_DEVICES=4 python live_api.py
 
113
  ```python
114
  CUDA_DEVICE_ORDER=PCI_BUS_ID HF_HOME=/data/dkounadis/.hf7/ CUDA_VISIBLE_DEVICES=4 python demo.py
115
  ```
116
+
117
+ # AudioBook
118
+
119
+ Convert your `.docx` to audio `.wav`. Via multiple voices, then concatenate all `audiobooks.wav` made with each voice to a full one
120
+ `concatenate audiobook has noisy speech, the individual single-voice audiobooks are clean, some issue with ffmpeg`. Therefore, for now, SHIFT repo only produces
121
+ single-voice audiobook. Archiving the multiple-voice `audiobook.py` here.
122
+
123
+ ```python
124
+ # uses Flask api.py
125
+ # needs to load ../shift/assets/INCLUSION_IN_MUSEUMS_audiobook.docx
126
+ #
127
+ #
128
+ python audiobook.py
129
+ ```