suayptalha/Maestro-10B · Hugging Face

Model Information

Maestro-10B

suayptalha/Maestro-10B arcee-ai/Virtuoso-Lite DeepSeek-V3 10b Parameters

Base Model

Virtuoso-Lite

Maestro-10B is a 10 billion parameter model fine-tuned from Virtuoso-Lite, a next-generation language model developed by arcee-ai. Virtuoso-Lite itself is based on the Llama-3 architecture, distilled from Deepseek-v3 using approximately 1.1 billion tokens/logits. This distillation process allows Virtuoso-Lite to achieve robust performance with a smaller parameter count, excelling in reasoning, code generation, and mathematical problem-solving. Maestro-10B inherits these strengths from its base model, Virtuoso-Lite, and further enhances them through fine-tuning on the OpenOrca dataset. This combination of a distilled base model and targeted fine-tuning makes Maestro-10B a powerful and efficient language model.

Loss Graph

Support & Community:

Buy me a coffee suayptalha - Discord

suayptalha
/

Maestro-10B

Maestro-10B

Model Information

Maestro-10B

Base Model

Loss Graph

Support & Community:

Model tree for suayptalha/Maestro-10B

Dataset used to train suayptalha/Maestro-10B

Collection including suayptalha/Maestro-10B

Maestro Models