yenson-lau
's Collections
Starred
updated
Pass@k Training for Adaptively Balancing Exploration and Exploitation of
Large Reasoning Models
Paper
•
2508.10751
•
Published
•
28
Reinforcement Pre-Training
Paper
•
2506.08007
•
Published
•
263
MCP-Universe: Benchmarking Large Language Models with Real-World Model
Context Protocol Servers
Paper
•
2508.14704
•
Published
•
43
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
Paper
•
2508.16153
•
Published
•
160
AgentScope 1.0: A Developer-Centric Framework for Building Agentic
Applications
Paper
•
2508.16279
•
Published
•
53
Are LLM-Judges Robust to Expressions of Uncertainty? Investigating the
effect of Epistemic Markers on LLM-based Evaluation
Paper
•
2410.20774
•
Published
Provable Benefits of In-Tool Learning for Large Language Models
Paper
•
2508.20755
•
Published
•
11
Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI
Agents
Paper
•
2509.06917
•
Published
•
41
RLP: Reinforcement as a Pretraining Objective
Paper
•
2510.01265
•
Published
•
40
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper
•
2508.03680
•
Published
•
122
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science
Paper
•
2510.16872
•
Published
•
106
Scaling Latent Reasoning via Looped Language Models
Paper
•
2510.25741
•
Published
•
221
Emu3.5: Native Multimodal Models are World Learners
Paper
•
2510.26583
•
Published
•
108