Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
81
10
15
Guilherme Penedo
guipenedo
Follow
BeNotAfraid777's profile picture
madhukarreddyk's profile picture
theboy32's profile picture
708 followers
·
6 following
gui_penedo
guipenedo
AI & ML interests
None yet
Articles
FineWeb2-C: Help Build Better Language Models in Your Language
2 days ago
•
10
Organizations
guipenedo
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a dataset
17 days ago
HuggingFaceFW/fineweb-2
Viewer
•
Updated
17 days ago
•
13.8B
•
86.4k
•
361
liked
a Space
29 days ago
Running
32
💬
Discussion Forum
liked
a model
about 2 months ago
HuggingFaceTB/SmolLM2-1.7B-Instruct
Text Generation
•
Updated
21 days ago
•
94.1k
•
442
liked
a Space
2 months ago
Running
47
📝
Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks
liked
a Space
3 months ago
Running
93
📖
TxT360: Trillion Extracted Text
liked
a model
3 months ago
cis-lmu/glotlid
Text Classification
•
Updated
Oct 26
•
8.14k
•
51
liked
a dataset
3 months ago
tiiuae/falcon-refinedweb
Viewer
•
Updated
Jun 20, 2023
•
968M
•
34.7k
•
825
liked
a Space
5 months ago
Running
362
🧽
Finegrain Object Eraser
Erase any object just by naming it!
liked
2 models
5 months ago
HuggingFaceTB/SmolLM-1.7B
Text Generation
•
Updated
Oct 16
•
10.2k
•
164
HuggingFaceTB/SmolLM-1.7B-Instruct
Text Generation
•
Updated
Aug 18
•
44.8k
•
107
liked
a model
6 months ago
AI-MO/NuminaMath-7B-TIR
Text Generation
•
Updated
Aug 14
•
2.85k
•
321
liked
a dataset
7 months ago
HuggingFaceFW/fineweb-edu
Viewer
•
Updated
5 days ago
•
3B
•
329k
•
571
liked
a Space
7 months ago
Running
547
🍷
FineWeb: decanting the web for the finest text data at scale
liked
a dataset
8 months ago
HuggingFaceFW/fineweb
Viewer
•
Updated
7 days ago
•
48.4B
•
245k
•
1.79k
liked
a Space
about 1 year ago
Running
206
🚀
GPT Baker