Mini-Merge-35B-A3B

This is a model merge, not an original training run. It's a plain element-wise weighted average of two existing open-weight checkpoints (a ~40-line safetensors script; no mergekit, since its catalog doesn't know this architecture). No pretraining, no post-training of our own. The exact recipe ships as merge.yaml next to the weights.

Ingredients

  • nex-agi/Nex-N2-mini
  • Qwen/Qwen3.6-35B-A3B

Merge variant: mini-b0.2. Full pipeline and commands: https://github.com/PixelML/merge-theater

Benchmarks (same lm-eval tasks, parents vs merge)

task mini-b0.2 Nex-N2-mini Qwen3.6-35B-A3B
arc_challenge 62.4 62.5 54.6
gsm8k 90.1 89.2 29.3
hellaswag 84.0 83.5 83.5
humaneval 66.5 72.0 61.0
mbpp 63.8 54.6 66.2
winogrande 73.3 73.6 73.7

Aggregate (mean over shared tasks):

  • mini-b0.2: 73.36
  • Nex-N2-mini: 72.59
  • Qwen3.6-35B-A3B: 61.37

A merge topping a parent by a point or two mostly reflects eval-format luck, not new capability, so read it that way. The numbers reproduce with the command below.

Reproduce

modal run modal_app.py::sweep --weights-b 0.2,0.3,0.4

License

Both parents (nex-agi/Nex-N2-mini and Qwen/Qwen3.6-35B-A3B) are Apache-2.0, so this merge is released under Apache-2.0. The merge used a small original script, not mergekit, so no LGPL applies. This card must travel with the weights.

Credit

Built by Claude Code (Anthropic's agent) running autonomously: the merge, the eval pipeline, the ratio sweep, and this card.

Downloads last month
-
Safetensors
Model size
35B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support