quark_awq_g128_int4_asym_bf16_onnx_npu 1.3 Collection Models quantized by Quark and prepared for the OGA-based NPU-only execution flow (Ryzen AI 1.3) • 11 items • Updated Dec 14, 2024 • 1
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published 3 days ago • 254
Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence Paper • 2404.05892 • Published Apr 8, 2024 • 33