Maxime Labonne's picture

Maxime Labonne PRO

mlabonne

AI & ML interests

Post-training, model editing, quantization

Recent Activity

Organizations

Blog-explorers's profile picture Qwen's profile picture ZeroGPU Explorers's profile picture Merge Crew's profile picture Social Post Explorers's profile picture Liquid AI's profile picture gg-tt's profile picture rg-preview's profile picture Hugging Face Reasoning Course's profile picture gg-hf-g's profile picture

mlabonne's activity

commented on Uncensor any LLM with abliteration 2 days ago
commented on Uncensor any LLM with abliteration 2 days ago
New activity in mlabonne/gemma-3-27b-it-abliterated-GGUF 6 days ago

Need mmproj file

5
#2 opened 7 days ago by
notmebug
New activity in mlabonne/gemma-3-27b-it-abliterated-GGUF 7 days ago

Need mmproj file

5
#2 opened 7 days ago by
notmebug
reacted to burtenshaw's post with 🚀❤️🤗 7 days ago
view post
Post
2591
NEW UNIT in the Hugging Face Reasoning course. We dive deep into the algorithm behind DeepSeek R1 with an advanced and hands-on guide to interpreting GRPO.

🔗 reasoning-course

This unit is super useful if you’re tuning models with reinforcement learning. It will help with:

- interpreting loss and reward progression during training runs
- selecting effective parameters for training
- reviewing and defining effective reward functions

This unit also works up smoothly toward the existing practical exercises form @mlabonne and Unsloth.

📣 Shout out to @ShirinYamani who wrote the unit. Follow for more great content.
  • 1 reply
·