Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
datablations
https://github.com/huggingface/datablations
Activity Feed
Request to join this org
Follow
15
AI & ML interests
Scaling Data-Constrained Language Models
Recent Activity
teven
authored
a paper
3 months ago
Pixtral 12B
Muennighoff
authored
a paper
3 months ago
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
srush
authored
a paper
3 months ago
Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks
View all activity
Team members
9
models
38
Sort: Recently updated
datablations/lm1-2b8-55b-oscartasky
Updated
Jun 24, 2023
datablations/lm1-2b8-55b-tasky
Updated
Jun 13, 2023
datablations/lm1-8b7-178b-c4-repetitions
Updated
May 30, 2023
datablations/lm1-8b7-178b-oscar-repetitions
Updated
May 30, 2023
•
1
datablations/lm1-misc
Updated
May 30, 2023
datablations/lm1-4b2-84b-c4-repetitions
Updated
May 30, 2023
datablations/lm1-2b8-55b-c4-perplexity
Updated
May 26, 2023
datablations/lm1-misc-pile
Updated
May 25, 2023
datablations/lm1-2b8-55b-c4-repetitions
Updated
May 20, 2023
datablations/lm1-misc-oscar
Updated
May 20, 2023
Expand 38 models
datasets
13
Sort: Recently updated
datablations/scripts
Viewer
•
Updated
Jun 15, 2023
•
3.48M
•
565
datablations/oscar-subsets
Viewer
•
Updated
Jun 14, 2023
•
365k
•
349
datablations/c4-subsets
Viewer
•
Updated
Jun 14, 2023
•
729k
•
442
•
2
datablations/c4-filter-megatron
Updated
May 28, 2023
•
85
datablations/oscar-filter-megatron
Updated
May 27, 2023
•
50
datablations/python-megatron
Updated
May 22, 2023
•
252
•
1
datablations/subsets
Viewer
•
Updated
May 10, 2023
•
365k
•
21
datablations/oscar-filter
Viewer
•
Updated
May 10, 2023
•
432M
•
2.23k
datablations/oscar-dedup-expanded
Viewer
•
Updated
May 10, 2023
•
432M
•
1.48k
datablations/mup
Updated
Apr 24, 2023
•
116
Expand 13 datasets