Quasar Series of Models
Introducing Quasar-3.3-Max
This model is provided by SILX INC. It has been supervised fine-tuned using the open-r1 repository. The training data includes sequences of varying lengths (32k, 16k, and 8k) to enhance the model's knowledge and adaptability.
Quasar-3.3-Max represents the first step in the Quasar project before Reinforcement Learning (RL). At this stage, the model's reasoning steps are capped at a maximum length of 8129 tokens to optimize processing efficiency and contextual understanding.
Stay tuned for further updates as we advance the Quasar project with RL enhancements!
Resources
Founders
- Eyad Gomaa
- Gomaa Salah
- Downloads last month
- 204
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.