Model Card for ViTMix-v1

This model is a poorly functional demo to using MOEs in computer vision

Model Details

Model Description

This Model is mean't to serve more as a blueprint than a base. It has been trained of fashionmnist to prove that I can do tensor maths. It achieves an average loss of 0.4-ish.

The code is in files. Do what you want!

Downloads last month
1
Safetensors
Model size
391M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train SE6446/VitMix-v1