Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
l3lab
's Collections
L1
miniCTX
L1
updated
Mar 7
L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning
Upvote
5
l3lab/L1-Qwen-1.5B-Max
Updated
Mar 7
•
37.7k
•
14
l3lab/L1-Qwen-1.5B-Exact
Updated
18 days ago
•
8.36k
•
4
Upvote
5
+1
Share collection
View history
Collection guide
Browse collections