Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
fla-hub
/
gsa-1.3B-100B
like
0
Follow
fla-hub
43
Text Generation
Safetensors
cerebras/SlimPajama-627B
English
fla
gsa
arxiv:
2409.07146
License:
mit
Model card
Files
Files and versions
Community
1
main
gsa-1.3B-100B
Commit History
Upload GSAForCausalLM
1e4ffda
verified
yzhangcs
commited on
Feb 9
Remove the `norm_first` option
7c18483
yzhangcs
commited on
Feb 6
Update README.md
39d60a1
verified
yzhangcs
commited on
Sep 30, 2024
Update README.md
781baa4
verified
yzhangcs
commited on
Sep 30, 2024
Link model to paper (
#1
)
63d0c7d
verified
yzhangcs
nielsr
HF staff
commited on
Sep 22, 2024
Update tokenizer_config.json
2c8b93b
verified
yzhangcs
commited on
Sep 2, 2024
Upload GSAForCausalLM
023f4e2
verified
yzhangcs
commited on
Jun 7, 2024
initial commit
f060f9a
verified
yzhangcs
commited on
Jun 7, 2024