Alfaxad Eyembe

Alfaxad

AI & ML interests

AI, Robotics

Recent Activity

Organizations

None yet

Alfaxad's activity

upvoted an article 1 day ago
view article
Article

LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone!

31
New activity in Alfaxad/gemma2-27b-swahili-it 4 days ago

More datasets

4
#1 opened 18 days ago by
Skier8402
reacted to Jaward's post with 🔥🧠❤️ 24 days ago
view post
Post
3867
Finally here it is: a faster, custom, scalable GRPO trainer for smaller models with < 500M params, can train on 8gb ram cpu, also supports gpu for sanity sake (includes support for vllm + flash attention). Using smolLM2-135M/360M-instructs as ref & base models. Experience your own “aha” moment 🐳 on 8gb ram.
Code: https://github.com/Jaykef/ai-algorithms/blob/main/smollm2_360M_135M_grpo_gsm8k.ipynb
  • 2 replies
·