HerrHruby/sft_qwen_3_1p7b_reasoning_cache_e2e_deepscaler_16k_2048_toks_560_steps 2B • Updated 2 days ago • 12
HerrHruby/sft_qwen_3_1p7b_reasoning_cache_e2e_deepscaler_16k_2048_toks_400_steps 2B • Updated 2 days ago • 14
HerrHruby/sft_qwen_3_1p7b_reasoning_cache_seq_sft_16k_1024_toks_1500_steps 2B • Updated 5 days ago • 6
HerrHruby/reasoning_cache_deepscalr_16k_sft_e2e_summaries_2048_38k Viewer • Updated about 14 hours ago • 38.8k
HerrHruby/reasoning_cache_deepscalr_16k_sft_e2e_prompt_summaries_1024_2048_73k Viewer • Updated 1 day ago • 73.1k • 7
HerrHruby/reasoning_cache_deepscalr_16k_sft_e2e_prompt_summaries_2048_small Viewer • Updated 1 day ago • 18.2k • 12