YigeYuan

1t4chi

AI & ML interests

None yet

Recent Activity

updated a model 3 days ago

1t4chi/Qwen2.5-Math-7B-KL0-1.0e-07-checkpoint-532

published a model 3 days ago

1t4chi/Qwen2.5-Math-7B-KL0-1.0e-07-checkpoint-532

published a model 4 days ago

1t4chi/Qwen2.5-Math-7B-4GPU-Nothink-KL0.00005

View all activity

Organizations

None yet

1t4chi's activity

updated a model 3 days ago

1t4chi/Qwen2.5-Math-7B-KL0-1.0e-07-checkpoint-532

Updated 3 days ago • 19

published a model 3 days ago

1t4chi/Qwen2.5-Math-7B-KL0-1.0e-07-checkpoint-532

Updated 3 days ago • 19

published a model 4 days ago

1t4chi/Qwen2.5-Math-7B-4GPU-Nothink-KL0.00005

Updated 4 days ago

published a model 6 days ago

1t4chi/Qwen2.5-Math-7B-HJX8k-4GPU-Nothink-KL0.0-FindData

Updated 6 days ago

published a model 21 days ago

1t4chi/mistral-7b-base-simper

Updated 21 days ago

liked a Space 3 months ago

345

Reward Bench Leaderboard

📐

Explore and analyze RewardBench leaderboard data

liked a model 3 months ago

RLHFlow/RewardModel-Mistral-7B-for-DPA-v1

Text Classification • Updated May 23, 2024 • 180 • 3

liked 2 models 4 months ago

allenai/tulu-v2.5-dpo-13b-hh-rlhf

Text Generation • Updated Jun 14, 2024 • 17 • 1

allenai/tulu-2-dpo-13b

Text Generation • Updated May 17, 2024 • 3.99k • 20

liked a model 5 months ago

PKU-Alignment/beaver-7b-v1.0

Reinforcement Learning • Updated May 9, 2024 • 418 • 10

liked 3 datasets 5 months ago

liked 2 models 5 months ago

ChenmieNLP/Zephyr-7B-Beta-Helpful

Text Generation • Updated Oct 10, 2024 • 38 • 1

HelpingAI/HelpingAI-9B

Text Generation • Updated Oct 31, 2024 • 252 • 25

liked 2 datasets 6 months ago

rngusry/UltraFeedback-honesty-preferences

Viewer • Updated Aug 3, 2024 • 251k • 95 • 1

rngusry/UltraFeedback-truthfulness-preferences

Viewer • Updated Jul 25, 2024 • 217k • 143 • 1

updated 3 datasets 6 months ago

1t4chi/ultrafeedback-binarized-processed

Viewer • Updated Sep 13, 2024 • 63.1k • 46

1t4chi/hh-rlhf-harmless-processed

Viewer • Updated Sep 13, 2024 • 44.8k • 35

1t4chi/hh-rlhf-helpful-processed

Viewer • Updated Sep 13, 2024 • 46.2k • 42