Alifian candra
Alian95
AI & ML interests
None yet
Recent Activity
replied to
burtenshaw's
post
15 days ago
NEW UNIT in the Hugging Face Reasoning course. We dive deep into the algorithm behind DeepSeek R1 with an advanced and hands-on guide to interpreting GRPO.
🔗 https://huggingface.co/reasoning-course
This unit is super useful if you’re tuning models with reinforcement learning. It will help with:
- interpreting loss and reward progression during training runs
- selecting effective parameters for training
- reviewing and defining effective reward functions
This unit also works up smoothly toward the existing practical exercises form @mlabonne and Unsloth.
📣 Shout out to @ShirinYamani who wrote the unit. Follow for more great content.
new activity
15 days ago
huggingface/InferenceSupport:Alian95/Alian95
published
a model
15 days ago
Alian95/Alian95
Organizations
None yet
Alian95's activity
Alian95/Alian95
1
#132 opened 15 days ago
by
Alian95

Alian95/Alian95
1
#132 opened 15 days ago
by
Alian95
