S P Sharan
Syzygianinfern0
AI & ML interests
LLMs, multimodal research, robotics
Recent Activity
upvoted
a
paper
1 day ago
s1: Simple test-time scaling
reacted
to
di-zhang-fdu's
post
with 👍
3 months ago
LLaMA-O1: Open Large Reasoning Model Frameworks For Training, Inference and Evaluation With PyTorch and HuggingFace
Large Reasoning Models powered by Monte Carlo Tree Search (MCTS), Self-Play Reinforcement Learning, PPO, AlphaGo Zero's dua policy paradigm and Large Language Models!
https://github.com/SimpleBerry/LLaMA-O1/
What will happen when you compound MCTS ❤ LLM ❤ Self-Play ❤RLHF?
Just a little bite of strawberry!🍓
Past related works:
https://huggingface.co/papers/2410.02884
https://huggingface.co/papers/2406.07394
upvoted
a
paper
6 months ago
Transformer Explainer: Interactive Learning of Text-Generative Models
Organizations
Syzygianinfern0's activity
Is this an untouched vicuna weight?
2
#1 opened almost 2 years ago
by
Syzygianinfern0
Request: DOI
#4 opened about 2 years ago
by
Syzygianinfern0