yilong xu's picture

yilong xu

sapphirex

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

updated a dataset 19 days ago

sapphirex/lucene-msmarcov2.1

published a dataset 19 days ago

sapphirex/lucene-msmarcov2.1

View all activity

Organizations

upvoted a paper 4 days ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Paper • 2602.17684 • Published 23 days ago • 21

upvoted a paper 21 days ago

MeKi: Memory-based Expert Knowledge Injection for Efficient LLM Scaling

Paper • 2602.03359 • Published 24 days ago • 9

upvoted a collection 4 months ago

Annotation-Efficient Universal Honesty Alignment

Official Collections of paper "Annotation-Efficient Universal Honesty Alignment". • 5 items • Updated Oct 21, 2025 • 3

upvoted a paper 4 months ago

Annotation-Efficient Universal Honesty Alignment

Paper • 2510.17509 • Published Oct 20, 2025 • 22

upvoted 2 papers 7 months ago

Training a Utility-based Retriever Through Shared Context Attribution for Retrieval-Augmented Language Models

Paper • 2504.00573 • Published Apr 1, 2025 • 2

RAVine: Reality-Aligned Evaluation for Agentic Search

Paper • 2507.16725 • Published Jul 22, 2025 • 31

upvoted a collection 7 months ago

MiniCPM4

MiniCPM4: Ultra-Efficient LLMs on End Devices • 30 items • Updated 16 days ago • 84

upvoted a paper 8 months ago

RefineX: Learning to Refine Pre-training Data at Scale from Expert-Guided Programs

Paper • 2507.03253 • Published Jul 4, 2025 • 19

upvoted a collection over 1 year ago

BGE

31 items • Updated 23 days ago • 146