neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-FP8-dynamic Text Generation • Updated about 1 month ago • 61 • 1
neuralmagic/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-FP8-dynamic Text Generation • Updated about 1 month ago • 21
neuralmagic/Sparse-Llama-3.1-8B-gsm8k-2of4-FP8-dynamic Text Generation • Updated about 1 month ago • 218 • 1
neuralmagic/Sparse-Llama-3.1-8B-gsm8k-2of4-FP8-dynamic Text Generation • Updated about 1 month ago • 218 • 1
neuralmagic/Sparse-Llama-3.1-8B-gsm8k-2of4-quantized.w4a16 Text Generation • Updated about 1 month ago • 52
neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-quantized.w4a16 Text Generation • Updated about 1 month ago • 185 • 3
neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-FP8-dynamic Text Generation • Updated about 1 month ago • 61 • 1
neuralmagic/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-FP8-dynamic Text Generation • Updated about 1 month ago • 21
neuralmagic/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-quantized.w4a16 Text Generation • Updated about 1 month ago • 17
neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w4a16 Text Generation • Updated Dec 17, 2024 • 6.58k • 23
Sparse-Llama-3.1-2of4 Collection 2:4 sparse versions of Llama-3.1, including transfer learning • 10 items • Updated Dec 18, 2024 • 4