
A fine-tuned multilingual model for Vietnamese reasoning
NOTE
- This model is a test version for our new training pipeline. THIS IS NOT SUITABLE FOR PRODUCTION
- The full model will be updated soon.
π Overview
- A language model with reasoning capability that provides step-by-step reasoning in Vietnamese before delivering answers.
- The model follows a structured XML format with explicit reasoning tags.
- It's designed for educational applications and complex problem-solving tasks in Vietnamese.
π§ Method
- Fine-tuned using Group Relative Policy Optimization (GRPO) with Unsloth for hardware efficiency.
- Employs rule-based reward functions to encourage adherence to Vietnamese XML reasoning format.
- Uses LoRA adaptation on a Vietnamese dataset spanning various task types.
π« Quantization
Coming Soon!
π€ Contributors
Developed with β€οΈ by BlossomAI
Star βοΈ this repo if you find it valuable!
- Downloads last month
- 44
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
Model tree for BlossomsAI/BloomVN-8B-Chat-Reasoning
Base model
Qwen/Qwen2.5-7B
Finetuned
sail/Sailor2-8B
Finetuned
sail/Sailor2-8B-Chat
Finetuned
BlossomsAI/BloomVN-8B-chat