Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters Paper • 2406.05955 • Published Jun 10, 2024 • 24
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v2 Text Generation • Updated Dec 18, 2024 • 798 • 15