HerrHruby/sft_qwen_3_1p7b_reasoning_cache_e2e_deepscaler_16k_38k_2048_toks_2100_steps 2B • Updated about 14 hours ago • 7
HerrHruby/sft_qwen_3_1p7b_reasoning_cache_e2e_deepscaler_16k_2048_toks_560_steps 2B • Updated 3 days ago • 12
HerrHruby/sft_qwen_3_1p7b_reasoning_cache_e2e_deepscaler_16k_2048_toks_400_steps 2B • Updated 3 days ago • 14
HerrHruby/sft_qwen_3_1p7b_reasoning_cache_seq_sft_16k_1024_toks_1500_steps 2B • Updated 6 days ago • 12