Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
prithivMLmods
/
Bellatrix-Tiny-1B-R1
like
7
Text Generation
Transformers
Safetensors
English
llama
GRPO
Reinforcement learning
trl
SFT
conversational
text-generation-inference
Inference Endpoints
License:
llama3.2
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Bellatrix-Tiny-1B-R1
/
README.md
Commit History
Update README.md
40ce38e
verified
prithivMLmods
commited on
12 days ago
Update README.md
f793075
verified
prithivMLmods
commited on
12 days ago
Update README.md
2365c22
verified
prithivMLmods
commited on
13 days ago
initial commit
6620ebe
verified
prithivMLmods
commited on
14 days ago