Exceeded GPU quota

what did you say I can do to avoid paid plans? by fast changing vpns and proxy?

Basically, that would be the main method. However, the possibility of measures being taken in the future should be borne in mind.
I myself am not familiar with cloud services, but I heard that if you re-start a process from a cloud service, the IP is usually changed at the same time, so it seems that no particular tool is needed.
So I don’t have any suggestions for special tools in this case.
I do know about old-fashioned underground tools to botnets as an education, but… you know that as well, and I don’t think I can recommend that.

My software is new and initially I would wanna test out with some users first and not use inference endpoints

If it is a well-known model, Serverless should still be available. I’ll go look for a link now.

There is a list of volunteers.

The HF Hub was updated yesterday and it appears that it is now possible to check if an Inference is available.

With warm, this means that it can be used as before.