Transformers
PyTorch
bert

Model details

minimoe-4L-384H distilled from bert-large-uncased on Wikipedia.

repository: https://github.com/GeneZC/MiniMoE arXiv: https://arxiv.org/abs/2305.12129

Downloads last month
5
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train GeneZC/bert-large-minimoe-4L-384H