NanoR1 / README.md
devZeeshaan's picture
Update README.md
49ab6e8 verified
metadata
base_model: unsloth/Qwen2.5-3B-Instruct-unsloth-bnb-4bit
library_name: peft
license: apache-2.0
pipeline_tag: text-generation
language:
  - en
tags:
  - unsloth
  - grpo
  - trl
  - transformers
  - qwen2.5
  - text-generation-inference
  - PyTorch
  - gsm8k

Model Description

  • Developed by: Jeesan Abbas
  • License: Apache license 2.0
  • Finetuned from model: unsloth/Qwen2.5-3B-Instruct-unsloth-bnb-4bit

Uploaded model

This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.