Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
deepseek-ai
/
deepseek-moe-16b-base
like
81
Text Generation
Transformers
Safetensors
deepseek
custom_code
arxiv:
2401.06066
License:
deepseek
Model card
Files
Files and versions
Community
6
Train
Use this model
487c5e7
deepseek-moe-16b-base
3 contributors
History:
3 commits
zwd973-deepseek
update readme
487c5e7
9 months ago
.gitattributes
1.52 kB
initial commit
9 months ago
README.md
1.93 kB
update readme
9 months ago
configuration_deepseek.py
10.2 kB
initial commit
9 months ago
modeling_deepseek.py
72.7 kB
initial commit
9 months ago