seyf EL

seyf1elislam

AI & ML interests

Timeseries Forcasting,LLM (Finetunes, merges,quantization...)

Recent Activity

liked a model 2 days ago
microsoft/phi-4
liked a model 15 days ago
google/gemma-scope
liked a model 15 days ago
Qwen/QVQ-72B-Preview
View all activity

Organizations

ZeroGPU Explorers's profile picture Social Post Explorers's profile picture Hugging Face Discord Community's profile picture

seyf1elislam's activity

reacted to AdinaY's post with πŸ‘€ about 2 months ago
reacted to di-zhang-fdu's post with πŸ‘ 2 months ago
view post
Post
6394
LLaMA-O1: Open Large Reasoning Model Frameworks For Training, Inference and Evaluation With PyTorch and HuggingFace
Large Reasoning Models powered by Monte Carlo Tree Search (MCTS), Self-Play Reinforcement Learning, PPO, AlphaGo Zero's dua policy paradigm and Large Language Models!
https://github.com/SimpleBerry/LLaMA-O1/

What will happen when you compound MCTS ❀ LLM ❀ Self-Play ❀RLHF?
Just a little bite of strawberry!πŸ“

Past related works:
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning (2410.02884)
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B (2406.07394)
  • 2 replies
Β·