GGUF
Inference Endpoints

Commit History

Upload imatrix.dat
72d3578
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-IQ1_S.gguf
7a881e3
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-IQ1_M.gguf
1e7170f
verified

ThomasBaruzier commited on