Hello,
Can anyone recommend a cost-effective platform to deploy our RAG model and make it available as an API for ongoing use?
Hello,
Can anyone recommend a cost-effective platform to deploy our RAG model and make it available as an API for ongoing use?
Would it be something like this?
Yes, I’m implementing this, but instead of deploying it as a web app or chatbot, I want to use it purely as an API.
If so, the cheapest way is to deploy it as a web app and use it from the Gradio client.
The Endpoint API is probably faster and smarter, but there is a fee.
can we make an API using LangServe and use that API in c# code …???
Gradio can be used from curl, for example, so of course you can get by with C#…
About Endpoint, I’ve never used Endpoint, but maybe you can?
Yes, Thanks
But do you have any idea about Lang Serve using which we can create an Fast API and also use it as an REST API, actually i am not sure about this want some guidance on this
I don’t know anything about Lang Serve, but from my search, it seems to be intended for use with Cloud Run and the like. Or maybe Amazon AWS?
I’m not familiar with cloud services…
As for FastAPI, the backend of Gradio should also be FastAPI, so it should be basically the same for HF.
I hope this will give some extra information:
Thanks, this information helps me a lot.
This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.