GGUF
Inference Endpoints
ThomasBaruzier's picture
Upload Llama-3.1-Minitron-4B-Width-Base-Q8_0.gguf
6585cfd verified