How much memory to fine tune wav2vec2?

willcai · March 6, 2022, 9:30pm

I’m trying to replicate this blog post on fine tuning XLSR (Fine-Tune XLSR-Wav2Vec2 for low-resource ASR with 🤗 Transformers ) and I’m running into CUDA out of memory issues. I’m training on a machine with multiple nvidia titan V (12 gb memory) and even when I:

reduce batch size to 1
remove all clips with > 5 seconds (even reduced this down to 2 seconds)
use adafactor instead of adamw (as suggested here: Performance and Scalability: How To Fit a Bigger Model and Train It Faster)

I still run out of memory. I’m not sure if this suggests there is a bug in my code somewhere or I simply don’t have enough memory to do this - any advice would be appreciated!

anwarika · March 7, 2022, 12:36am

Is it able to start training before running into Cuda issues? Have you tried model sharding if you only have access to 12gb gpus? You could try using cloud resources, they have 15gb gpus and 32gb gpus.

willcai · March 7, 2022, 10:20pm

I think it doesn’t get to the stage where it says x/y epochs, etc, but the cell with trainer.train() does at least some computation before running out of memory.

I haven’t tried model sharding, thanks for suggesting that - I’ll look into it!

If that doesn’t work, maybe I’ll look into cloud options.

Topic		Replies	Views
Wav2vec2.0 memory issue Models	13	11213	December 25, 2024
Wav2vec2-xls-r-2b out of memory issues on A100 (40 GB) Models	0	667	May 13, 2022
How to finetune wav2vec2.0-xlsr model with long audio files Beginners	1	769	September 6, 2022
Wav2vec fine-tuning with multiGPU Models	16	6834	May 22, 2021
Multi GPU Audio Finetuning for Wav2vec2 Failing for 4 GPUs but successful for 1 GPU Beginners	0	297	July 9, 2023

How much memory to fine tune wav2vec2?

Related topics