Deploy an ML model and make it accessible as an API

riddhi810 · October 28, 2024, 9:05am

Hello,

Can anyone recommend a cost-effective platform to deploy our RAG model and make it available as an API for ongoing use?

John6666 · October 28, 2024, 11:35am

Would it be something like this?

riddhi810 · October 28, 2024, 12:10pm

Yes, I’m implementing this, but instead of deploying it as a web app or chatbot, I want to use it purely as an API.

John6666 · October 28, 2024, 12:14pm

If so, the cheapest way is to deploy it as a web app and use it from the Gradio client.
The Endpoint API is probably faster and smarter, but there is a fee.

riddhi810 · October 28, 2024, 12:18pm

can we make an API using LangServe and use that API in c# code …???

John6666 · October 28, 2024, 12:20pm

Gradio can be used from curl, for example, so of course you can get by with C#…
About Endpoint, I’ve never used Endpoint, but maybe you can?

riddhi810 · October 28, 2024, 12:25pm

Yes, Thanks

But do you have any idea about Lang Serve using which we can create an Fast API and also use it as an REST API, actually i am not sure about this want some guidance on this

John6666 · October 28, 2024, 12:32pm

I don’t know anything about Lang Serve, but from my search, it seems to be intended for use with Cloud Run and the like. Or maybe Amazon AWS?
I’m not familiar with cloud services…
As for FastAPI, the backend of Gradio should also be FastAPI, so it should be basically the same for HF.

mahmutc · October 28, 2024, 12:37pm

I hope this will give some extra information:

riddhi810 · October 28, 2024, 12:37pm

Thanks, this information helps me a lot.

system · October 29, 2024, 12:38am

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
What is best way to serve huggingface model with API? Beginners	11	38645	August 29, 2023
How to use llm model's api? Beginners	2	472	November 14, 2024
How to Use HuggingFace free Embedding models Beginners	3	1616	October 7, 2024
What might be the best way to deploy a websocket server in HuggingFace space? Spaces	0	731	August 31, 2023
Create an Assistant to be used via Python scripts Beginners	13	183	September 22, 2024

Related topics