VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks Paper • 2504.05118 • Published 1 day ago • 15
meta-llama/Llama-4-Maverick-17B-128E-Instruct Image-Text-to-Text • Updated about 10 hours ago • 12.4k • • 246
Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model Paper • 2503.24290 • Published 9 days ago • 60
ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation Paper • 2503.21729 • Published 13 days ago • 27
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 14 days ago • 104