Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
/
mamba2-hybrid-8b-3t-128k
like
42
Follow
NVIDIA
7.55k
Text Generation
English
Megatron-LM
nvidia
Mamba
Mamba-2
SSM
8B
arxiv:
2406.07887
arxiv:
2405.21060
License:
apache-2.0
Model card
Files
Files and versions
Community
4
main
mamba2-hybrid-8b-3t-128k
1 contributor
History:
2 commits
rwaleffe
Upload model
9083d98
7 months ago
release
Upload model
7 months ago
.gitattributes
Safe
1.52 kB
initial commit
7 months ago
README.md
Safe
2.23 kB
Upload model
7 months ago
latest_checkpointed_iteration.txt
Safe
8 Bytes
Upload model
7 months ago
mt_nlg_plus_multilingual_ja_zh_the_stack_frac_015_256k.model
Safe
4.57 MB
LFS
Upload model
7 months ago