Junxian He's picture

23 52

Junxian He

jxhe

·

https://jxhe.github.io

AI & ML interests

None yet

Recent Activity

liked a model 7 days ago

MiniMaxAI/MiniMax-M2.5

upvoted a paper 10 days ago

LOCA-bench: Benchmarking Language Agents Under Controllable and Extreme Context Growth

upvoted a paper 14 days ago

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

View all activity

Organizations

upvoted a paper 10 days ago

LOCA-bench: Benchmarking Language Agents Under Controllable and Extreme Context Growth

Paper • 2602.07962 • Published 12 days ago • 24

upvoted a paper 14 days ago

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

Paper • 2602.05885 • Published 15 days ago • 28

upvoted a paper 4 months ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29, 2025 • 46

upvoted a paper 5 months ago

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Paper • 2509.06501 • Published Sep 8, 2025 • 82

upvoted a paper 6 months ago

Model-Task Alignment Drives Distinct RL Outcomes

Paper • 2508.21188 • Published Aug 28, 2025 • 8

upvoted a collection 8 months ago

MiniMax-M1

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. • 6 items • Updated 7 days ago • 119

upvoted 2 papers 9 months ago

Pitfalls of Rule- and Model-based Verifiers -- A Case Study on Mathematical Reasoning

Paper • 2505.22203 • Published May 28, 2025 • 6

Learn to Reason Efficiently with Adaptive Length-based Reward Shaping

Paper • 2505.15612 • Published May 21, 2025 • 34

upvoted a collection 11 months ago

SimpleRL-Zoo

The collection for the Paper "SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild" • 13 items • Updated May 5, 2025 • 8

upvoted a paper 11 months ago

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published Mar 24, 2025 • 31

upvoted a paper 12 months ago

Predictive Data Selection: The Data That Predicts Is the Data That Teaches

Paper • 2503.00808 • Published Mar 2, 2025 • 56

upvoted a collection about 1 year ago

SimpleRL

The collection for the Project "Simple Reinforcement Learning for Reasoning" • 2 items • Updated Feb 19, 2025 • 7

upvoted 3 papers about 1 year ago

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published Feb 11, 2025 • 50

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published Dec 23, 2024 • 42

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published Dec 23, 2024 • 47

upvoted a paper almost 2 years ago

Compression Represents Intelligence Linearly

Paper • 2404.09937 • Published Apr 15, 2024 • 28

upvoted a collection almost 2 years ago

Zephyr 7B Gemma

Models, dataset, and Demo for Zephyr 7B Gemma. For code to train the models, see: https://github.com/huggingface/alignment-handbook • 5 items • Updated Apr 12, 2024 • 15

upvoted 2 papers about 2 years ago

AppAgent: Multimodal Agents as Smartphone Users

Paper • 2312.13771 • Published Dec 21, 2023 • 54

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning

Paper • 2312.15685 • Published Dec 25, 2023 • 16

upvoted a paper over 2 years ago

LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

Paper • 2307.13269 • Published Jul 25, 2023 • 34