Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
Akhil Theerthala
Akhil-Theerthala
Follow
0 followers
·
2 following
akhil-theerthala
AI & ML interests
None yet
Recent Activity
replied
to
burtenshaw
's
post
1 day ago
Here’s a notebook to make Gemma reason with GRPO & TRL. I made this whilst prepping the next unit of the reasoning course: In this notebooks I combine together google’s model with some community tooling - First, I load the model from the Hugging Face hub with transformers’s latest release for Gemma 3 - I use PEFT and bitsandbytes to get it running on Colab - Then, I took Will Browns processing and reward functions to make reasoning chains from GSM8k - Finally, I used TRL’s GRPOTrainer to train the model Next step is to bring Unsloth AI in, then ship it in the reasoning course. Links to notebook below. https://colab.research.google.com/drive/1Vkl69ytCS3bvOtV9_stRETMthlQXR4wX?usp=sharing
updated
a dataset
1 day ago
Akhil-Theerthala/Personal-Finance-Queries
published
a dataset
8 days ago
Akhil-Theerthala/Personal-Finance-Queries
View all activity
Organizations
None yet
models
None public yet
datasets
2
Sort: Recently updated
Akhil-Theerthala/Personal-Finance-Queries
Preview
•
Updated
1 day ago
•
62
Akhil-Theerthala/PersonalFinance-CoTR-5K
Viewer
•
Updated
25 days ago
•
5.02k
•
77
•
1