Tianze Wang's picture

Tianze Wang

tzwilliam0

AI & ML interests

None yet

Recent Activity

updated a model 2 days ago

tzwilliam0/qwen-dapo-17k-vs

published a model 2 days ago

tzwilliam0/qwen-dapo-17k-vs

updated a model 2 days ago

tzwilliam0/qwen-dapo-17k-vr

View all activity

Organizations

None yet

models 55

tzwilliam0/qwen-dapo-17k-vs

Text Generation • 4B • Updated 2 days ago • 548

tzwilliam0/qwen-dapo-17k-vr

Text Generation • 4B • Updated 2 days ago • 83

tzwilliam0/merged_baseline_2.34

Updated Nov 26, 2025

tzwilliam0/merged_baseline_1.66

Updated Nov 26, 2025

tzwilliam0/merged_baseline_1.23

Updated Nov 26, 2025

tzwilliam0/merged_baseline_0.93

Updated Nov 26, 2025

tzwilliam0/merged_baseline_0.68

Updated Nov 26, 2025

tzwilliam0/Math_Instruct_merged_2.34

Updated Nov 25, 2025 • 1

tzwilliam0/Math_Instruct_merged_1.66

Updated Nov 25, 2025

tzwilliam0/Math_Instruct_merged_1.23

Updated Nov 25, 2025

datasets 19

tzwilliam0/semi_Ministral3_8B

Viewer • Updated Mar 14 • 4.96k • 7

tzwilliam0/semi_qwen2.5_0.5B

Viewer • Updated Mar 14 • 4.96k • 8

tzwilliam0/instruction_following

Viewer • Updated Oct 21, 2025 • 19.9k • 4

tzwilliam0/instruction_following_dpo_filtered_add

Viewer • Updated Oct 21, 2025 • 18.8k • 4

tzwilliam0/instruction_following_dpo_filtered

Viewer • Updated Oct 20, 2025 • 10.3k • 4

tzwilliam0/math_reward_training

Viewer • Updated Oct 17, 2025 • 2.42k • 4

tzwilliam0/non_reasoning_reward_training

Viewer • Updated Oct 16, 2025 • 30k • 5

tzwilliam0/non_reasoning_training

Viewer • Updated Oct 16, 2025 • 30k • 5

tzwilliam0/Safe_dpo_helpful

Viewer • Updated Jul 31, 2025 • 30.4k • 5

tzwilliam0/Safe_dpo_harmless

Viewer • Updated Jul 31, 2025 • 30.4k • 4

View 19 datasets