4 16

Qiyuan Zhang

DonJoey

AI & ML interests

None yet

Recent Activity

authored a paper 18 days ago

Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge

upvoted a paper 18 days ago

Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge

commented on a paper 18 days ago

Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge

View all activity

Organizations

None yet

DonJoey's activity

authored a paper 18 days ago

Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge

Paper • 2502.12501 • Published 19 days ago • 6

upvoted a paper 18 days ago

Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge

Paper • 2502.12501 • Published 19 days ago • 6

commented a paper 18 days ago

Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge

Paper • 2502.12501 • Published 19 days ago • 6 •

upvoted a paper about 2 months ago

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

Paper • 2501.12570 • Published Jan 22 • 24

authored a paper 2 months ago

NILE: Internal Consistency Alignment in Large Language Models

Paper • 2412.16686 • Published Dec 21, 2024 • 8

upvoted a collection 2 months ago

Tulu 3 Datasets

Collection

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 26 days ago • 74

upvoted 2 papers 2 months ago

MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Paper • 2410.02743 • Published Oct 3, 2024 • 7

RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

Paper • 2412.14922 • Published Dec 19, 2024 • 86

upvoted a paper 3 months ago

NILE: Internal Consistency Alignment in Large Language Models

Paper • 2412.16686 • Published Dec 21, 2024 • 8

commented a paper 3 months ago

NILE: Internal Consistency Alignment in Large Language Models

Paper • 2412.16686 • Published Dec 21, 2024 • 8 •

upvoted 5 papers 3 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 349

Reliable, Reproducible, and Really Fast Leaderboards with Evalica

Paper • 2412.11314 • Published Dec 15, 2024 • 2

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 46

From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge

Paper • 2411.16594 • Published Nov 25, 2024 • 39

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 59

upvoted 4 papers 5 months ago

commented a paper 5 months ago

RevisEval: Improving LLM-as-a-Judge via Response-Adapted References

Paper • 2410.05193 • Published Oct 7, 2024 • 13 •