🔄 In a Training Loop

Asankhaya Sharma

codelion

86 142 372

http://asankhaya.github.io/

AI & ML interests

Creator of OptiLLM, OpenEvolve, Adaptive Classifier, and Ellora. Pioneering a new category in AI infrastructure: inference-time compute for LLMs.

Recent Activity

liked a model about 21 hours ago

ddark-il/granite-4.1-8b-optiq

liked a model about 21 hours ago

Joni-121/gemma-3-1b-it-reasoning-grpo-lora-F16-GGUF

liked a dataset 1 day ago

codelion/synth-1B

View all activity

Organizations

upvoted a paper 6 days ago

Evolution Fine-Tuning: Learning to Discover Across 371 Optimization Tasks

Paper • 2606.29082 • Published 10 days ago • 37

upvoted an article 11 days ago

Article

SPROG-9M: how far a 9-million-parameter, LLM-free model gets on grade-school math

codelion

•

11 days ago

• 1

upvoted a paper 13 days ago

Qwen-AgentWorld: Language World Models for General Agents

Paper • 2606.24597 • Published 14 days ago • 145

upvoted 3 papers 20 days ago

Even with AI, Bijection Discovery is Still Hard: The Opportunities and Challenges of OpenEvolve for Novel Bijection Construction

Paper • 2511.20987 • Published Nov 26, 2025 • 1

GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms

Paper • 2511.17592 • Published Nov 17, 2025 • 122

EvoX: Meta-Evolution for Automated Discovery

Paper • 2602.23413 • Published Feb 26 • 2

upvoted a paper about 1 month ago

KVarN: Variance-Normalized KV-Cache Quantization Mitigates Error Accumulation in Reasoning Tasks

Paper • 2606.03458 • Published Jun 2 • 67

upvoted a paper about 2 months ago

Self-Distilled Agentic Reinforcement Learning

Paper • 2605.15155 • Published May 14 • 116

upvoted a paper 2 months ago

Efficient Training on Multiple Consumer GPUs with RoundPipe

Paper • 2604.27085 • Published Apr 29 • 47

upvoted 2 collections 3 months ago

YOLO 26

Collection

5 items • Updated Apr 11 • 2

Nemotron-Post-Training-v3

Collection

Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 50 items • Updated 25 days ago • 169

upvoted a changelog 3 months ago

Hugging Face Changelog

Agent Traces on the Hub

Apr 7

• 149

upvoted an article 3 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 911

upvoted a collection 4 months ago

Nano Language Models

Collection

A collection of really small language models pre-trained from scratch with open-data. Ideal for use in experimentation and evaluations. • 3 items • Updated Mar 25 • 1

upvoted an article 4 months ago

Article

Scaling Pedagogical Pre-training: From Optimal Mixing to 10 Billion Tokens

codelion

•

Mar 6

• 5

upvoted a collection 4 months ago

🤏 Smol-Data

Collection

Tried and tested mixes for strong pretraining. Inspired by https://huggingface.co/blog/codelion/optimal-dataset-mixing • 14 items • Updated Mar 2 • 13

upvoted a paper 5 months ago

PaperBanana: Automating Academic Illustration for AI Scientists

Paper • 2601.23265 • Published Jan 30 • 229

upvoted an article 5 months ago

Article

Reverse Engineering a $500M Mystery: From HashHop to Memory-Augmented Language Models

codelion

•

Jan 23

• 10

upvoted an article 6 months ago

Article

The Optimal Architecture for Small Language Models

codelion

•

Dec 26, 2025

• 121

upvoted a paper 7 months ago

Universal Reasoning Model

Paper • 2512.14693 • Published Dec 16, 2025 • 44

Asankhaya Sharma

AI & ML interests

Recent Activity

Organizations

codelion's activity

SPROG-9M: how far a 9-million-parameter, LLM-free model gets on grade-school math

Agent Traces on the Hub

Welcome Gemma 4: Frontier multimodal intelligence on device

Scaling Pedagogical Pre-training: From Optimal Mixing to 10 Billion Tokens

Reverse Engineering a $500M Mystery: From HashHop to Memory-Augmented Language Models

The Optimal Architecture for Small Language Models