Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2504.00050

MLLM-as-a-Judge for Image Safety without Human Labeling

Paper • 2501.00192 • Published Dec 31, 2024 • 30
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1 • 107
Xmodel-2 Technical Report

Paper • 2412.19638 • Published Dec 27, 2024 • 27
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published Dec 25, 2024 • 101

JudgeLRM: Large Reasoning Models as a Judge

Paper • 2504.00050 • Published 17 days ago • 57

🤔 Reasoning about Reasoning

papers and articles about reasoning LLMs

about 12 hours ago

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Paper • 2503.21614 • Published 20 days ago • 39
Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published 16 days ago • 61
JudgeLRM: Large Reasoning Models as a Judge

Paper • 2504.00050 • Published 17 days ago • 57
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought

Paper • 2504.05599 • Published 9 days ago • 77

Benchmark and Evaluation

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24 • 72
Benchmarking LLMs for Political Science: A United Nations Perspective

Paper • 2502.14122 • Published Feb 19 • 2
IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval

Paper • 2503.04644 • Published Mar 6 • 20
ExpertGenQA: Open-ended QA generation in Specialized Domains

Paper • 2503.02948 • Published Mar 4

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 275
Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 114
START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 108
SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published 23 days ago • 29

2025 LLM Papers on Hugging Face with Japanese Memos

about 7 hours ago

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models

Paper • 2501.02955 • Published Jan 6 • 45
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1 • 107
MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published Jan 21 • 86
VideoWorld: Exploring Knowledge Learning from Unlabeled Videos

Paper • 2501.09781 • Published Jan 16 • 29

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs