How to use Inference API (serverless) in my model page?

LimYeri · June 3, 2024, 1:46pm

Hi All.
I have uploaded my fine-tuned model to Hugging Face. I want to create an inference API (serverless) on the model page, but a timeout occurs. What should I do?

Here is how I wrote the README:
pipeline_tag: text-generation
inference:
parameters:
max_new_tokens: 300
stop:
- <|end_of_text|>
- <|eot_id|>

kayrab · June 28, 2024, 5:53am

Screenshot from 2024-06-28 08-52-33
I am having a similar problem

Topic		Replies	Views
Inference API timeout Site Feedback	0	167	May 29, 2024
Deploying to Model Hub for Inference with custom tokenizer Beginners	1	612	January 1, 2022
Model loading always times out? Beginners	0	124	August 19, 2024
Serverless Inference API error on new model Inference Endpoints on the Hub	5	197	September 9, 2024
How to configure a model for Inference API? Models	0	336	May 23, 2024

How to use Inference API (serverless) in my model page?

Related topics