Why does all my gpu memory get used with a small model?

imranq · March 5, 2022, 6:50pm

I’m using colab pro to try out a few huggingface models, but even loading a 2gb model completely fills up the GPU memory (Tesla 16gb) and then as the data loads i frequently run out of memory. Can anyone explain why a small model will maximize gpu memory?

My dataset size is about 1gb between train and test if that makes a difference

anwarika · March 7, 2022, 12:40am

I can’t say for sure without seeing how you are running it. Can you troubleshoot and see whats running on your gpu?

imranq · March 12, 2022, 5:22pm

Thanks! Do you know how I can see what processes are running in the terminal?

anwarika · March 13, 2022, 2:11pm

Can you supply your parameters on how you are training? ie batchsize, epochs, etc. I have run into memory issues with 1gb model, 2gb is actually pretty big when training.

To check gpu resources in googlecolab you can try something like this

#On the left side you can open Terminal ('>_' with black background)

#You can run commands from there even when some cell is running

#Write command to see GPU usage in real-time:

$ watch nvidia-smi

imranq · March 13, 2022, 2:54pm

Thanks! I will try that out

BramVanroy · March 13, 2022, 7:20pm

Are you using Tensorflow? By default it allocates all the available GPU memory. There are ways around that though.

Topic		Replies	Views
Huge disparity between CPU and GPU memory usage? 🤗Transformers	0	396	February 22, 2022
Free up GPU memory after training is finished or interrupted (on Colab) 🤗Transformers	1	1406	May 30, 2024
Can't load huge model onto multiple GPU's Beginners	5	4693	June 15, 2023
Cuda out of memory issue training whisper model on single GPU Intermediate	0	827	December 15, 2023
Colab error (memory crashes) Beginners	3	3031	April 22, 2021

Why does all my gpu memory get used with a small model?

Related topics