arxiv:2408.11857
Jeffrey Quesnelle PRO
emozilla
AI & ML interests
None yet
Organizations
Papers
2
models
67
emozilla/llama2-150m-init-nanotron
Updated
emozilla/llama2-150m-init
Text Generation
•
Updated
•
5
emozilla/llama2-20m-init-nanotron
Updated
emozilla/llama2-1.2b-init-v2
Text Generation
•
Updated
•
49
emozilla/llama2-20m-init
Text Generation
•
Updated
•
9
emozilla/llama2-1.2b-init-nanotron
Updated
emozilla/llama2-20m-crelu-init
Text Generation
•
Updated
•
5
emozilla/llama2-1.2b-init
Text Generation
•
Updated
•
15
emozilla/llama3-1.6b-init
Text Generation
•
Updated
•
9
emozilla/llama3-1.3b-gptneox-init
Text Generation
•
Updated
•
10
datasets
50
emozilla/dolma-v1_7-30B-tokenized-llama2-nanoset
Updated
•
1
emozilla/fineweb-10bt-tokenized-datatrove-llama2
Updated
•
1
•
1
emozilla/fineweb-350bt-tokenized-datatrove-llama2
Updated
•
1
emozilla/dolma-v1_7-305B-tokenized-llama2-nanoset
Updated
•
2
emozilla/proofpile-test-tokenized-llama3
Viewer
•
Updated
•
46.3k
•
2
emozilla/PaulGrahamEssays
Viewer
•
Updated
•
49
•
2
emozilla/dolma-v1_7-cc_en_head
Viewer
•
Updated
•
475M
•
2
emozilla/dolma-v1_7-c4
Viewer
•
Updated
•
250M
•
2
•
1
emozilla/dolma-v1_7-305B-tokenized-llama3-nanoset
Updated
•
2
emozilla/dolma-v1_7-books
Viewer
•
Updated
•
56k
•
271
•
1