6 22

Thomas Bouvier

tbouvier

https://thomas-bouvier.io

AI & ML interests

HPC for ML, large-scale pretraining, ML4Science

Recent Activity

liked a dataset about 1 month ago

ILSVRC/imagenet-1k

liked a dataset 8 months ago

LEAP/ClimSim_high-res

upvoted an article 8 months ago

Finally, a Replacement for BERT: Introducing ModernBERT

View all activity

Organizations

None yet

liked a dataset about 1 month ago

ILSVRC/imagenet-1k

Viewer • Updated Sep 17, 2025 • 1.43M • 77.3k • 742

liked a dataset 8 months ago

LEAP/ClimSim_high-res

Updated Sep 29, 2023 • 70.2k • 12

upvoted an article 8 months ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

•

728

liked a dataset 9 months ago

mcherukara/PtychoNN_data

Updated Mar 18, 2025 • 135 • 2

liked a model 10 months ago

allenai/ACE2-ERA5

Updated Nov 18, 2025 • 65 • 15

liked a model 11 months ago

microsoft/aurora

Updated Jun 20, 2025 • 50

upvoted an article 11 months ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Oct 7, 2024

•

liked a Space 12 months ago

Memory Viz

🧠

Memory Viz

liked 2 Spaces about 1 year ago

Predict Memory

🧮

106

Calculate and visualize model memory usage from config

The Ultra-Scale Playbook

🌌

3.72k

The ultimate guide to training LLM on large GPU Clusters

upvoted an article about 1 year ago

Article

Open-R1: Update #1

Feb 2, 2025

•

305

liked 2 datasets about 1 year ago

PleIAs/common_corpus

Viewer • Updated 13 days ago • 69.9k • 93.1k • 382

HuggingFaceFW/fineweb-edu

Viewer • Updated Jul 11, 2025 • 3.5B • 228k • 967

liked 3 models about 1 year ago

upvoted a collection about 1 year ago

ModernBERT

Collection

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 158

liked a model about 1 year ago

answerdotai/ModernBERT-base

Fill-Mask • 0.1B • Updated Jan 15, 2025 • 1.02M • 1k

liked 2 Spaces about 1 year ago

TheWell

🌍

Visualization of data from the Well

FineWeb: decanting the web for the finest text data at scale

🍷

1.3k

Generate a curated web‑text dataset for LLM training

Thomas Bouvier

AI & ML interests

Recent Activity

Organizations

tbouvier's activity

Finally, a Replacement for BERT: Introducing ModernBERT

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Memory Viz

Predict Memory

The Ultra-Scale Playbook

Open-R1: Update #1

TheWell

FineWeb: decanting the web for the finest text data at scale