--- language: - en base_model: - arcee-ai/Virtuoso-Lite datasets: - Open-Orca/OpenOrca pipeline_tag: text-generation library_name: transformers tags: - unsloth - trl - sft --- Maestro-10B

Maestro-10B

Model banner
Created by suayptalha

Model Information

Maestro-10B

suayptalha/Maestro-10B arcee-ai/Virtuoso-Lite DeepSeek-V3 10b Parameters

Base Model

Maestro-10B is a 10 billion parameter model fine-tuned from Virtuoso-Lite, a next-generation language model developed by arcee-ai. Virtuoso-Lite itself is based on the Llama-3 architecture, distilled from Deepseek-v3 using approximately 1.1 billion tokens/logits. This distillation process allows Virtuoso-Lite to achieve robust performance with a smaller parameter count, excelling in reasoning, code generation, and mathematical problem-solving. Maestro-10B inherits these strengths from its base model, Virtuoso-Lite, and further enhances them through fine-tuning on the OpenOrca dataset. This combination of a distilled base model and targeted fine-tuning makes Maestro-10B a powerful and efficient language model.

Loss Graph

Model banner