inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.0_bits_mode_heuristic 20B • Updated 8 days ago • 14
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.0_bits_mode_hybrid 20B • Updated 8 days ago • 19
inference-optimization/gpt-oss-120b-from-self-ckpt5-speculator.eagle3 0.9B • Updated 10 days ago • 71
inference-optimization/gpt-oss-120b-from-self-ckpt3-speculator.eagle3 0.9B • Updated 10 days ago • 60
inference-optimization/gpt-oss-120b-from-self-ckpt4-speculator.eagle3 0.9B • Updated 10 days ago • 55
inference-optimization/gpt-oss-120b-from-self-ckpt2-speculator.eagle3 0.9B • Updated 10 days ago • 64
inference-optimization/gpt-oss-120b-from-self-ckpt1-speculator.eagle3 0.9B • Updated 10 days ago • 60
inference-optimization/gpt-oss-120b-from-self-ckpt0-speculator.eagle3 0.9B • Updated 10 days ago • 61
inference-optimization/Qwen3-Next-80B-A3B-Instruct-GSM8K-MTP-finetuned 81B • Updated 10 days ago • 53
inference-optimization/Qwen3-30B-from-Qwen3-235B_resps-speculators.eagle3-ckpt3 0.5B • Updated 11 days ago • 24
inference-optimization/gpt-oss-120b-from-qwen235b-ckpt3-speculator.eagle3 0.9B • Updated 11 days ago • 41
inference-optimization/gpt-oss-120b-from-qwen235b-ckpt1-speculator.eagle3 0.9B • Updated 11 days ago • 26
inference-optimization/gpt-oss-120b-from-qwen235b-ckpt0-speculator.eagle3 0.9B • Updated 11 days ago • 25
inference-optimization/Qwen3-30B-from-Qwen3-235B_resps-speculators.eagle3-ckpt2 0.5B • Updated 11 days ago • 23
inference-optimization/Qwen3-30B-from-Qwen3-235B_resps-speculators.eagle3-ckpt1 0.5B • Updated 11 days ago • 20