Logo

🌟 BloomVN-8B-Chat-Reasoning

A fine-tuned multilingual model for Vietnamese reasoning

NOTE

  • This model is a test version for our new training pipeline. THIS IS NOT SUITABLE FOR PRODUCTION
  • The full model will be updated soon.

πŸ“‹ Overview

  • A language model with reasoning capability that provides step-by-step reasoning in Vietnamese before delivering answers.
  • The model follows a structured XML format with explicit reasoning tags.
  • It's designed for educational applications and complex problem-solving tasks in Vietnamese.

πŸ”§ Method

  • Fine-tuned using Group Relative Policy Optimization (GRPO) with Unsloth for hardware efficiency.
  • Employs rule-based reward functions to encourage adherence to Vietnamese XML reasoning format.
  • Uses LoRA adaptation on a Vietnamese dataset spanning various task types.

πŸ’« Quantization

Coming Soon!

🀝 Contributors

Developed with ❀️ by BlossomAI


Star ⭐️ this repo if you find it valuable!
Downloads last month
44
Safetensors
Model size
8.55B params
Tensor type
BF16
Β·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for BlossomsAI/BloomVN-8B-Chat-Reasoning

Base model

Qwen/Qwen2.5-7B
Finetuned
sail/Sailor2-8B
Finetuned
(1)
this model
Quantizations
1 model