Vortex Collection ModelCloud optimized and validated quants that pass/meet strict quality assurance on multiple benchmarks. • 10 items • Updated 7 days ago • 7
ModelCloud/Falcon3-10B-Instruct-gptqmodel-4bit-vortex-v1 Text Generation • Updated 16 days ago • 44 • 3
Vortex Collection ModelCloud optimized and validated quants that pass/meet strict quality assurance on multiple benchmarks. • 10 items • Updated 7 days ago • 7
view article Article Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick By cxdu • Oct 24, 2024 • 10
ModelCloud/Qwen2.5-Coder-32B-Instruct-gptqmodel-4bit-vortex-v1 Text Generation • Updated Nov 14, 2024 • 115 • 12
ModelCloud/Qwen2.5-Coder-32B-Instruct-gptqmodel-4bit-vortex-v1 Text Generation • Updated Nov 14, 2024 • 115 • 12
ModelCloud/Llama-3.2-1B-Instruct-gptqmodel-4bit-vortex-v2.5 Text Generation • Updated Nov 11, 2024 • 565 • 4
ModelCloud/Llama-3.2-3B-Instruct-gptqmodel-4bit-vortex-v3 Text Generation • Updated Nov 11, 2024 • 365 • 5
ModelCloud/Llama-3.2-1B-Instruct-gptqmodel-4bit-vortex-v2 Text Generation • Updated Nov 11, 2024 • 15 • 3
ModelCloud/Llama-3.2-1B-Instruct-gptqmodel-4bit-vortex-v1 Text Generation • Updated Nov 11, 2024 • 151 • 2