AMD RyzenAI Models mohitsha/timm-resnet18-onnx-quantized-ryzen Updated Mar 21 mohitsha/transformers-resnet18-onnx-quantized-ryzen Image Classification • Updated Mar 21 • 37 mohitsha/Llama-2-7b-hf-quantized-brevitas Updated Mar 27 mohitsha/opt-125m-quantized-brevitas Text Generation • Updated Mar 27 • 12
FP8 KV Cache Models with FP8 KV Cache Scales mohitsha/Llama-2-70b-chat-hf-FP8-KV Text Generation • Updated Jun 25 • 18 mohitsha/Llama-2-7b-chat-hf-FP8-KV Text Generation • Updated Jun 25 • 25 mohitsha/Llama-2-7b-chat-hf-FP8-KV-AMMO Text Generation • Updated Jun 25 • 22 mohitsha/Llama-2-70b-chat-hf-FP8-KV-AMMO Text Generation • Updated Jun 25 • 20