9 12 2

Jiahang Xu

Jiahang

JiahangXu

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

upvoted a paper about 2 months ago

LongRoPE2: Near-Lossless LLM Context Window Scaling

published a model about 2 months ago

Jiahang/Qwen2.5-1.5B-Open-R1-Distill

View all activity

Organizations

Jiahang's activity

upvoted 2 papers about 2 months ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published Mar 3 • 84

LongRoPE2: Near-Lossless LLM Context Window Scaling

Paper • 2502.20082 • Published Feb 27 • 38

published a model about 2 months ago

Jiahang/Qwen2.5-1.5B-Open-R1-Distill

Updated Feb 24

upvoted a paper about 2 months ago

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Paper • 2502.14768 • Published Feb 20 • 48

upvoted a paper 2 months ago

Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization

Paper • 2502.04295 • Published Feb 6 • 13

commented a paper 2 months ago

Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization

Paper • 2502.04295 • Published Feb 6 • 13 •

upvoted 2 papers 3 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 99

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 277

upvoted 3 papers 8 months ago

authored 4 papers 8 months ago

SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference

Paper • 2303.08308 • Published Mar 15, 2023 • 1

ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices

Paper • 2303.09730 • Published Mar 17, 2023 • 1

Compresso: Structured Pruning with Collaborative Prompting Learns Compact Large Language Models

Paper • 2310.05015 • Published Oct 8, 2023 • 1

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12, 2024 • 73

upvoted a paper 8 months ago

Language Models as Black-Box Optimizers for Vision-Language Models

Paper • 2309.05950 • Published Sep 12, 2023 • 4

authored a paper 12 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 259

upvoted a collection about 1 year ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 748

liked a Space about 1 year ago

13k

Open LLM Leaderboard

🏆

Track, rank and evaluate open LLMs and chatbots

authored a paper about 1 year ago

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Paper • 2402.13753 • Published Feb 21, 2024 • 116