CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models Paper • 2602.17684 • Published 23 days ago • 21
MeKi: Memory-based Expert Knowledge Injection for Efficient LLM Scaling Paper • 2602.03359 • Published 24 days ago • 9
Annotation-Efficient Universal Honesty Alignment Collection Official Collections of paper "Annotation-Efficient Universal Honesty Alignment". • 5 items • Updated Oct 21, 2025 • 3
Training a Utility-based Retriever Through Shared Context Attribution for Retrieval-Augmented Language Models Paper • 2504.00573 • Published Apr 1, 2025 • 2
RAVine: Reality-Aligned Evaluation for Agentic Search Paper • 2507.16725 • Published Jul 22, 2025 • 31
MiniCPM4 Collection MiniCPM4: Ultra-Efficient LLMs on End Devices • 30 items • Updated 16 days ago • 84
RefineX: Learning to Refine Pre-training Data at Scale from Expert-Guided Programs Paper • 2507.03253 • Published Jul 4, 2025 • 19