14 32 189

Ken Tsui

kenhktsui

https://kenhktsui.github.io/

AI & ML interests

ML engineer, researcher VLM, LLM benchmark Opinions are my own

Recent Activity

liked a dataset about 1 month ago

Hothan/OlympiadBench

liked a dataset about 2 months ago

mixture-vitae-backup/MixtureVitae-2TT

upvoted a paper 3 months ago

Diffusion Transformers with Representation Autoencoders

View all activity

Organizations

upvoted 5 papers 3 months ago

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30, 2025 • 537

MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources

Paper • 2509.25531 • Published Sep 29, 2025 • 8

upvoted 3 papers 6 months ago

Self-Correction Bench: Revealing and Addressing the Self-Correction Blind Spot in LLMs

Paper • 2507.02778 • Published Jul 3, 2025 • 9

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26, 2025 • 75

Large Language Models are Locally Linear Mappings

Paper • 2505.24293 • Published May 30, 2025 • 14

upvoted an article 7 months ago

Article

LeRobot goes to driving school: World’s largest open-source self-driving dataset

Mar 11, 2025

•

103

upvoted 2 articles 8 months ago

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

751

Article

Vision Language Models (Better, faster, stronger)

May 12, 2025

•

578

upvoted a collection 9 months ago

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29, 2025 • 677

upvoted a paper 9 months ago

Scaling Vision Pre-Training to 4K Resolution

Paper • 2503.19903 • Published Mar 25, 2025 • 41

upvoted an article 9 months ago

Article

Breaking resolution curse of vision-language models

Feb 24, 2024

•

upvoted an article 11 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4, 2025

•

1.31k

upvoted a paper 11 months ago

ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning

Paper • 2502.01100 • Published Feb 3, 2025 • 19

upvoted a paper 12 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8, 2025 • 287

upvoted 2 papers about 1 year ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 108

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 85

upvoted an article over 1 year ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11, 2024

•

124

Ken Tsui

AI & ML interests

Recent Activity

Organizations

kenhktsui's activity

LeRobot goes to driving school: World’s largest open-source self-driving dataset

Uncensor any LLM with abliteration

Vision Language Models (Better, faster, stronger)

Breaking resolution curse of vision-language models

Open-source DeepResearch – Freeing our search agents

How NuminaMath Won the 1st AIMO Progress Prize