kaiwenw/nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_4 Text Generation • Updated Dec 7, 2024 • 1
kaiwenw/nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_3 Text Generation • Updated Dec 7, 2024 • 1
kaiwenw/nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_2 Text Generation • Updated Dec 7, 2024
kaiwenw/nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_1 Text Generation • Updated Dec 7, 2024
kaiwenw/open_r1_apr9_DeepSeek_R1_Distill_Qwen_32B_tokenized Viewer • Updated 4 days ago • 49.4k • 111
kaiwenw/open_r1_apr9_DeepSeek_R1_Distill_Qwen_1.5B_tokenized Viewer • Updated 7 days ago • 49.4k • 98
kaiwenw/open_r1_mar2_DeepSeek_R1_Distill_Qwen_1.5B_tokenized Viewer • Updated 14 days ago • 49.5k • 119