arxiv:2412.13670
Mingzhe Du
Elfsong
AI & ML interests
Code Generation / Preference Alignment / Bias Mitigation
Recent Activity
updated
a dataset
2 days ago
Elfsong/apps_generation
updated
a dataset
2 days ago
Elfsong/apps_generation
updated
a dataset
2 days ago
Elfsong/apps_generation
Organizations
Papers
2
spaces
5
models
17
Elfsong/Phi-4-14B-Instruct-sft
Text Generation
•
Updated
•
2
Elfsong/Llama-3.1-8B-Instruct-sft
Text Generation
•
Updated
•
244
•
1
Elfsong/Phi-3.5-4B-instruct-sft
Text Generation
•
Updated
•
5
•
1
Elfsong/Llama-3.3-70B-Instruct-dpo
Text Generation
•
Updated
•
15
Elfsong/Llama-3.3-70B-Instruct-stf
Text Generation
•
Updated
•
67
Elfsong/Llama-3.1-8B-Instruct-dpo
Text Generation
•
Updated
•
40
Elfsong/mouadsfilter
Text2Text Generation
•
Updated
•
4
Elfsong/dpo
Updated
Elfsong/debias_model
Updated
Elfsong/my_awesome_model
Updated
datasets
67
Elfsong/apps_generation
Viewer
•
Updated
•
2.25k
•
224
Elfsong/Venus_t
Viewer
•
Updated
•
2.08k
•
635
Elfsong/Venus_KTO
Viewer
•
Updated
•
631k
•
24
Elfsong/Venus_SFT
Viewer
•
Updated
•
276k
•
35
Elfsong/Venus_DPO
Viewer
•
Updated
•
127k
•
19
Elfsong/Llama-3.3-70B-Instruct-sft-response
Viewer
•
Updated
•
256
•
31
Elfsong/Llama-3.3-70B-Instruct-dpo-response
Viewer
•
Updated
•
256
•
29
Elfsong/Llama-3.3-70B-Instruct-response
Viewer
•
Updated
•
256
•
31
Elfsong/Llama-3.1-8B-Instruct-dpo-response
Viewer
•
Updated
•
256
•
30
Elfsong/Llama-3.1-8B-Instruct-response
Viewer
•
Updated
•
256
•
31