2 125 28

Young-Jun Lee PRO

passing2961

https://sites.google.com/view/passing2961/home

AI & ML interests

Social Dialogue System, Multi-Modal Dialogue

Recent Activity

upvoted a paper 3 days ago

LIMO: Less is More for Reasoning

upvoted a paper 3 days ago

Demystifying Long Chain-of-Thought Reasoning in LLMs

upvoted a paper 3 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

View all activity

Organizations

None yet

passing2961's activity

upvoted 3 papers 3 days ago

upvoted 3 papers 4 days ago

ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning

Paper • 2502.01100 • Published 6 days ago • 14

Preference Leakage: A Contamination Problem in LLM-as-a-judge

Paper • 2502.01534 • Published 6 days ago • 34

VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

Paper • 2502.02492 • Published 5 days ago • 46

upvoted 2 papers 6 days ago

Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

Paper • 2501.18837 • Published 10 days ago • 8

s1: Simple test-time scaling

Paper • 2501.19393 • Published 9 days ago • 94

upvoted a paper 8 days ago

WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training

Paper • 2501.18511 • Published 10 days ago • 17

liked a model 9 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated about 8 hours ago • 2.43M • • 7.85k

upvoted 3 papers 16 days ago

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Paper • 2501.12895 • Published 18 days ago • 55

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published 18 days ago • 79

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 18 days ago • 305

upvoted 2 papers 20 days ago

PaSa: An LLM Agent for Comprehensive Academic Paper Search

Paper • 2501.10120 • Published 23 days ago • 43

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 24 days ago • 105

upvoted 2 papers 24 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 26 days ago • 273

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published 26 days ago • 53

upvoted a collection 24 days ago

SOTOPIA

Collection

8 items • Updated May 16, 2024 • 6

upvoted 2 papers 26 days ago

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published Jan 9 • 53

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 27 days ago • 89