ReTool: Reinforcement Learning for Strategic Tool Use in LLMs Paper • 2504.11536 • Published 5 days ago • 46
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning Paper • 2504.08837 • Published 10 days ago • 40
marksverdhei/whisper-norwenglish-large-frankenmerge Automatic Speech Recognition • Updated Mar 8 • 12 • 2
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 25 days ago • 112
UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning Paper • 2503.21620 • Published 24 days ago • 59
MAPS: A Multi-Agent Framework Based on Big Seven Personality and Socratic Guidance for Multimodal Scientific Problem Solving Paper • 2503.16905 • Published about 1 month ago • 54
Improving Autoregressive Image Generation through Coarse-to-Fine Token Prediction Paper • 2503.16194 • Published about 1 month ago • 8