Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
4
Yasir Kayaalp
yasirkayaalp
Follow
0 followers
Β·
1 following
yasir-kayaalp
AI & ML interests
None yet
Recent Activity
liked
a Space
about 2 months ago
Kwai-Kolors/Kolors-Character-With-Flux
liked
a Space
about 2 months ago
ginipick/FLUXllama
reacted
to
thomwolf
's
post
with π
about 2 months ago
We are proud to announce https://huggingface.co/datasets/HuggingFaceFW/fineweb-2: A sparkling update to https://huggingface.co/datasets/HuggingFaceFW/fineweb with 1000s of π£οΈlanguages. We applied the same data-driven approach that led to SOTA English performance inπ· FineWeb to thousands of languages. π₯ FineWeb2 has 8TB of compressed text data and outperforms other multilingual datasets in our experiments. The dataset is released under the permissive π ODC-By 1.0 license, and the π» code to reproduce it and our evaluations is public. We will very soon announce a big community project, and are working on a π blogpost walking you through the entire dataset creation process. Stay tuned! In the mean time come ask us question on our chat place: https://huggingface.co/spaces/HuggingFaceFW/discussion H/t @guipenedo @hynky @lvwerra as well as @vsabolcec Bettina Messmer @negar-foroutan and @mjaggi
View all activity
Organizations
None yet
spaces
2
Sort:Β Recently updated
Sleeping
Csaac
π
adsvvdsa
Sleeping
Test2
π¬
test2
models
2
Sort:Β Recently updated
yasirkayaalp/hukukAi
Text Classification
β’
Updated
Nov 30, 2024
β’
107
yasirkayaalp/test
Updated
Nov 29, 2024
datasets
None public yet