Submitted by
Omer Faruk Akgul
University of Southern California
university
Verified
AI & ML interests
None defined yet.
Recent Activity
Papers
Rethinking RL for LLM Reasoning: It's Sparse Policy Selection, Not Capability Learning
Precise Debugging Benchmark: Is Your Model Debugging or Regenerating?