OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond Paper • 2605.19660 • Published 3 days ago • 32
ThoughtTrace: Understanding User Thoughts in Real-World LLM Interactions Paper • 2605.20087 • Published 3 days ago • 10
FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation Paper • 2408.12168 • Published Aug 22, 2024
ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting Paper • 2406.19976 • Published Jun 28, 2024
R&D-Agent-Quant: A Multi-Agent Framework for Data-Centric Factors and Model Joint Optimization Paper • 2505.15155 • Published May 21, 2025 • 1
R&D-Agent: An LLM-Agent Framework Towards Autonomous Data Science Paper • 2505.14738 • Published May 20, 2025 • 1
EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL Paper • 2605.18703 • Published 4 days ago • 44
EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL Paper • 2605.18703 • Published 4 days ago • 44
EnvFactory Collection This is the checkpoints and dataset for: EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL. • 7 items • Updated 1 day ago • 1
EnvFactory Collection This is the checkpoints and dataset for: EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL. • 7 items • Updated 1 day ago • 1
EnvFactory Collection This is the checkpoints and dataset for: EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL. • 7 items • Updated 1 day ago • 1