Pritam Kumar Ravi's picture

Pritam Kumar Ravi

PritamcodesAGI

AI & ML interests

LLM for security , Computational Neuroscience.

Recent Activity

Organizations

MLX Community's profile picture ONNX Community's profile picture Hugging Face Discord Community's profile picture

PritamcodesAGI's activity

reacted to hesamation's post with ❤️ 14 days ago
reacted to burtenshaw's post with 🤗 about 1 month ago
view post
Post
3802
I’m super excited to work with @mlabonne to build the first practical example in the reasoning course.

🔗 reasoning-course

Here's a quick walk through of the first drop of material that works toward the use case:

- a fundamental introduction to reinforcement learning. Answering questions like, ‘what is a reward?’ and ‘how do we create an environment for a language model?’

- Then it focuses on Deepseek R1 by walking through the paper and highlighting key aspects. This is an old school way to learn ML topics, but it always works.

- Next, it takes to you Transformers Reinforcement Learning and demonstrates potential reward functions you could use. This is cool because it uses Marimo notebooks to visualise the reward.

- Finally, Maxime walks us through a real training notebook that uses GRPO to reduce generation length. I’m really into this because it works and Maxime took the time to validate it share assets and logging from his own runs for you to compare with.

Maxime’s work and notebooks have been a major part of the open source community over the last few years. I, like everyone, have learnt so much from them.
upvoted an article about 1 month ago
view article
Article

Visualize and understand GPU memory in PyTorch

213
upvoted an article about 2 months ago
view article
Article

FastRTC: The Real-Time Communication Library for Python

154