Ci Splunk PRO

Csplk

AI & ML interests

None yet

Recent Activity

liked a Space 2 days ago
Akshayram1/smol_agent
liked a Space 2 days ago
VIDraft/ChemGenesis
View all activity

Organizations

Blog-explorers's profile picture MetricLY's profile picture Hugging Face Discord Community's profile picture

Csplk's activity

reacted to hlarcher's post with πŸ”₯ 3 days ago
view post
Post
1004
We are introducing multi-backend support in Hugging Face Text Generation Inference!
With new TGI architecture we are now able to plug new modeling backends to get best performances according to selected model and available hardware. This first step will very soon be followed by the integration of new backends (TRT-LLM, llama.cpp, vLLM, Neuron and TPU).

We are polishing the TensorRT-LLM backend which achieves impressive performances on NVIDIA GPUs, stay tuned πŸ€— !

Check out the details: https://huggingface.co/blog/tgi-multi-backend