Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
akswelh
/
NEOX
like
0
arxiv:
29 papers
Model card
Files
Files and versions
Community
main
NEOX
/
megatron
1 contributor
History:
1 commit
akswelh
Upload 251 files
d90b3a8
verified
2 months ago
data
Upload 251 files
2 months ago
fused_kernels
Upload 251 files
2 months ago
gradient_noise_scale
Upload 251 files
2 months ago
model
Upload 251 files
2 months ago
mpu
Upload 251 files
2 months ago
neox_arguments
Upload 251 files
2 months ago
tokenizer
Upload 251 files
2 months ago
__init__.py
929 Bytes
Upload 251 files
2 months ago
checkpointing.py
17.6 kB
Upload 251 files
2 months ago
devutil.py
1.28 kB
Upload 251 files
2 months ago
initialize.py
8.58 kB
Upload 251 files
2 months ago
learning_rates.py
5.22 kB
Upload 251 files
2 months ago
logging.py
16.4 kB
Upload 251 files
2 months ago
mup_substitute.py
7.8 kB
Upload 251 files
2 months ago
optimizers.py
18.1 kB
Upload 251 files
2 months ago
text_generation_utils.py
42.3 kB
Upload 251 files
2 months ago
training.py
64.4 kB
Upload 251 files
2 months ago
utils.py
17.6 kB
Upload 251 files
2 months ago