27 36 95

Jingcheng Hu

reign12

AI & ML interests

Foundation models and alignment

Recent Activity

upvoted a paper 2 days ago

Kimi-VL Technical Report

upvoted a paper 6 days ago

Rethinking Reflection in Pre-Training

new activity 7 days ago

facebook/natural_reasoning:Is it possible to release the full set of the dataset?

View all activity

Organizations

reign12's activity

upvoted a paper 2 days ago

Kimi-VL Technical Report

Paper • 2504.07491 • Published 5 days ago • 108

upvoted a paper 6 days ago

Rethinking Reflection in Pre-Training

Paper • 2504.04022 • Published 11 days ago • 72

upvoted a paper 7 days ago

MegaMath: Pushing the Limits of Open Math Corpora

Paper • 2504.02807 • Published 12 days ago • 29

upvoted 3 papers 14 days ago

Expanding RL with Verifiable Rewards Across Diverse Domains

Paper • 2503.23829 • Published 15 days ago • 18

What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models

Paper • 2503.24235 • Published 15 days ago • 51

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published 15 days ago • 61

upvoted an article 28 days ago

Article

NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets

29 days ago

• 34

upvoted a paper 2 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 117

upvoted a paper 5 months ago

Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

Paper • 2411.04282 • Published Nov 6, 2024 • 36

upvoted 2 papers 9 months ago

Scaling Retrieval-Based Language Models with a Trillion-Token Datastore

Paper • 2407.12854 • Published Jul 9, 2024 • 32

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18, 2024 • 57

upvoted a paper 10 months ago

DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation

Paper • 2406.16855 • Published Jun 24, 2024 • 57

upvoted 2 papers 11 months ago

MotionLLM: Understanding Human Behaviors from Human Motions and Videos

Paper • 2405.20340 • Published May 30, 2024 • 21

Xwin-LM: Strong and Scalable Alignment Practice for LLMs

Paper • 2405.20335 • Published May 30, 2024 • 19

upvoted a paper 12 months ago

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Paper • 2404.08801 • Published Apr 12, 2024 • 68

upvoted 5 papers about 1 year ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11, 2024 • 94

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

Paper • 2404.06395 • Published Apr 9, 2024 • 23

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2, 2024 • 106

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Paper • 2403.09629 • Published Mar 14, 2024 • 78

RewardBench: Evaluating Reward Models for Language Modeling

Paper • 2403.13787 • Published Mar 20, 2024 • 23