Hsu Shihyueh
AIR-hl
AI & ML interests
Nothing
Recent Activity
updated
a model
4 days ago
AIR-hl/DeepSeek-R1-Distill-Qwen-7B-AIMO
published
a model
4 days ago
AIR-hl/DeepSeek-R1-Distill-Qwen-7B-AIMO
updated
a dataset
5 days ago
AIR-hl/OpenR1-OpenThoughts-SFT-math
Organizations
None yet
Collections
2
models
10

AIR-hl/DeepSeek-R1-Distill-Qwen-7B-AIMO
Updated
•
10

AIR-hl/Mistral-7B-Base-WPO-bf16
Text Generation
•
Updated
•
17

AIR-hl/Llama-3.2-3B-WPO
Text Generation
•
Updated
•
14

AIR-hl/Llama-3.2-3B-DPO
Text Generation
•
Updated
•
20
•
2

AIR-hl/Qwen2.5-1.5B-SimPO
Text Generation
•
Updated
•
35

AIR-hl/Qwen2.5-1.5B-WPO
Text Generation
•
Updated
•
32

AIR-hl/Qwen2.5-1.5B-DPO
Text Generation
•
Updated
•
24

AIR-hl/Llama-3.2-1B-DPO
Text Generation
•
Updated
•
46

AIR-hl/Llama-3.2-1B-ultrachat200k
Text Generation
•
Updated
•
42

AIR-hl/Qwen2.5-1.5B-ultrachat200k
Text Generation
•
Updated
•
55