arxiv:2410.10563
Dongfu Jiang
DongfuJiang
AI & ML interests
Large Language Model, Modality Reasoning and their evaluation
Recent Activity
updated
a Space
about 20 hours ago
TIGER-Lab/GenAI-Arena
updated
a dataset
1 day ago
CodeDPO/codedpo_20241208_openrlhf_format_hard
published
a dataset
1 day ago
CodeDPO/codedpo_20241208_openrlhf_format_hard
Organizations
Papers
10
models
38
DongfuJiang/Qwen2-VL-VAE-7B-Instruct
Image-Text-to-Text
•
Updated
•
23
DongfuJiang/Qwen2-VL-VAE-7B-Instruct-mochi-vae
Text2Text Generation
•
Updated
•
5
DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_12_pt
Text Generation
•
Updated
•
11
DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_12_sft
Text Generation
•
Updated
•
6
DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_6_pt
Text Generation
•
Updated
•
6
DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_6_sft
Text Generation
•
Updated
•
5
DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_pt
Text Generation
•
Updated
•
50
DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_sft
Updated
DongfuJiang/prm_gsm_2k_with_full_sol_mix_ref_remove_all_correct_hf
Text Generation
•
Updated
•
8
•
1
DongfuJiang/prm_qwen25_math_gsm_2k_with_full_sol_mix_ref_redistribution_hf
Text Generation
•
Updated
•
99
datasets
12
DongfuJiang/PRM_SFT
Viewer
•
Updated
•
4.01M
•
40
DongfuJiang/zeroeval
Viewer
•
Updated
•
13.5k
•
52
DongfuJiang/PRM_eval
Viewer
•
Updated
•
9.54k
•
40
DongfuJiang/eval
Viewer
•
Updated
•
6k
•
45
DongfuJiang/PRM_prepared
Viewer
•
Updated
•
39.9k
•
42
DongfuJiang/PRM_train
Viewer
•
Updated
•
32.7k
•
40
DongfuJiang/MATH-500
Viewer
•
Updated
•
500
•
94
DongfuJiang/simpo_v2_ultrafeedback
Viewer
•
Updated
•
59.9k
•
33
DongfuJiang/VAPO
Viewer
•
Updated
•
72.5k
•
33
DongfuJiang/PairRM-data
Viewer
•
Updated
•
586k
•
33