metadata
base_model: unsloth/Qwen2.5-3B-Instruct-unsloth-bnb-4bit
library_name: peft
license: apache-2.0
pipeline_tag: text-generation
language:
- en
tags:
- unsloth
- grpo
- trl
- transformers
- qwen2.5
- text-generation-inference
- PyTorch
- gsm8k
Model Description
- Developed by: Jeesan Abbas
- License: Apache license 2.0
- Finetuned from model: unsloth/Qwen2.5-3B-Instruct-unsloth-bnb-4bit
Uploaded model
This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.