29 11

Stephen Oates PRO

soates

AI & ML interests

None yet

Recent Activity

upvoted an article about 2 months ago

Train AI models with Unsloth and Hugging Face Jobs for FREE

upvoted an article 3 months ago

We Got Claude to Build CUDA Kernels and teach open models!

upvoted an article 4 months ago

Deriving the PPO Loss from First Principles

View all activity

Organizations

None yet

upvoted an article about 2 months ago

Article

Train AI models with Unsloth and Hugging Face Jobs for FREE

Feb 20

•

upvoted an article 3 months ago

Article

We Got Claude to Build CUDA Kernels and teach open models!

Jan 28

•

154

upvoted 2 articles 4 months ago

Article

Deriving the PPO Loss from First Principles

Dec 25, 2025

•

Article

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

Dec 8, 2025

•

upvoted a collection 4 months ago

Physics of Language Models: Part 4.2

Collection

16 items • Updated Jul 29, 2025 • 17

upvoted an article 4 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

619

upvoted a paper 6 months ago

The Massive Legal Embedding Benchmark (MLEB)

Paper • 2510.19365 • Published Oct 22, 2025 • 18

upvoted an article 6 months ago

Article

Australian-made LLM beats OpenAI and Google at legal retrieval

Oct 23, 2025

•

upvoted an article 7 months ago

Article

There is no such thing as a tokenizer-free lunch

Sep 25, 2025

•

updated a dataset 7 months ago

soates/australian-insurance-dspy-corpus

Viewer • Updated Sep 17, 2025 • 359 • 15

published a dataset 7 months ago

soates/australian-insurance-dspy-corpus

Viewer • Updated Sep 17, 2025 • 359 • 15

upvoted 2 papers 7 months ago

Virtual Agent Economies

Paper • 2509.10147 • Published Sep 12, 2025 • 27

The Majority is not always right: RL training for solution aggregation

Paper • 2509.06870 • Published Sep 8, 2025 • 15

updated a dataset 8 months ago

soates/tictactoe-gemma-dataset

Viewer • Updated Aug 15, 2025 • 93.6k • 6

published a dataset 8 months ago

soates/tictactoe-gemma-dataset

Viewer • Updated Aug 15, 2025 • 93.6k • 6

liked a model 9 months ago

Menlo/Lucy-128k

Text Generation • 2B • Updated Aug 4, 2025 • 370 • 109

liked a model 10 months ago

chandar-lab/NeoBERT

Feature Extraction • 0.2B • Updated Mar 25, 2025 • 19.1k • 194

upvoted 2 papers 11 months ago

Large Language Models are Locally Linear Mappings

Paper • 2505.24293 • Published May 30, 2025 • 14

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

Paper • 2505.11711 • Published May 16, 2025 • 11

upvoted an article 11 months ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

May 21, 2025

•

254

Stephen Oates PRO

AI & ML interests

Recent Activity

Organizations

soates's activity

Train AI models with Unsloth and Hugging Face Jobs for FREE

We Got Claude to Build CUDA Kernels and teach open models!

Deriving the PPO Loss from First Principles

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

We Got Claude to Fine-Tune an Open Source LLM

Australian-made LLM beats OpenAI and Google at legal retrieval

There is no such thing as a tokenizer-free lunch

nanoVLM: The simplest repository to train your VLM in pure PyTorch