Dokyoon

leeloolee

Eruly

AI & ML interests

Recent Activity

published a dataset 3 days ago

sionic-ai/reasoning

published a dataset 3 days ago

sionic-ai/reasoning-0.01-ko

liked a Space 5 days ago

nanotron/ultrascale-playbook

View all activity

Organizations

leeloolee's activity

upvoted a paper 14 days ago

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published 17 days ago • 46

upvoted a paper 24 days ago

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published 27 days ago • 40

upvoted a paper about 1 month ago

LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization

Paper • 2502.13922 • Published Feb 19 • 25

upvoted 2 papers about 2 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 114

Towards Physical Understanding in Video Generation: A 3D Point Regularization Approach

Paper • 2502.03639 • Published Feb 5 • 9

upvoted a paper 2 months ago

DiffuEraser: A Diffusion Model for Video Inpainting

Paper • 2501.10018 • Published Jan 17 • 14

upvoted a collection 3 months ago

🔍 Interpretability & Analysis of LMs

Collection

Outstanding research in LM interpretability and evaluation, summarized • 105 items • Updated 24 days ago • 97

upvoted 2 papers 3 months ago

PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models

Paper • 2501.03124 • Published Jan 6 • 14

GUI Agents: A Survey

Paper • 2412.13501 • Published Dec 18, 2024 • 28

upvoted 2 papers 4 months ago

Multimodal Latent Language Modeling with Next-Token Diffusion

Paper • 2412.08635 • Published Dec 11, 2024 • 44

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models

Paper • 2411.14982 • Published Nov 22, 2024 • 16

upvoted 2 collections 4 months ago

Multimodal-SAE

Collection

The collection of the sae that hooked on llava • 5 items • Updated Mar 4 • 8

GUI agents

Collection

A collection of papers on GUI agents • 3 items • Updated Dec 14, 2024 • 5

upvoted 3 papers 4 months ago

upvoted a paper 5 months ago

M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework

Paper • 2411.06176 • Published Nov 9, 2024 • 45