meta-llama/Llama-4-Scout-17B-16E-Instruct Image-Text-to-Text • Updated 2 days ago • 101k • • 630
Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback Paper • 2503.22230 • Published 12 days ago • 43
Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond Paper • 2503.10460 • Published 26 days ago • 27
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 28 days ago • 378
EasyRAG: Efficient Retrieval-Augmented Generation Framework for Automated Network Operations Paper • 2410.10315 • Published Oct 14, 2024 • 3
Regularizing Neural Networks via Adversarial Model Perturbation Paper • 2010.04925 • Published Oct 10, 2020
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published Jan 21 • 57