Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Rui Pan's picture
5 6 1

Rui Pan

ruipeterpan
·
https://ruipan.xyz/
  • ruipeterpan
  • ruipeterpan
  • ruipeterpan

AI & ML interests

Systems and algorithms for efficient LLM inference

Organizations

None yet

authored 3 papers 2 months ago

Mowgli: Passively Learned Rate Control for Real-Time Video

Paper • 2410.03339 • Published Oct 4, 2024 • 1

Fail Fast, Win Big: Rethinking the Drafting Strategy in Speculative Decoding via Diffusion LLMs

Paper • 2512.20573 • Published Dec 23, 2025 • 1

Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning

Paper • 2210.00093 • Published Sep 30, 2022
authored 2 papers 11 months ago

SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning

Paper • 2504.07891 • Published Apr 10, 2025 • 5

Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving

Paper • 2312.05385 • Published Dec 8, 2023 • 1
authored 2 papers about 1 year ago

RAGServe: Fast Quality-Aware RAG Systems with Configuration Adaptation

Paper • 2412.10543 • Published Dec 13, 2024 • 1

Marconi: Prefix Caching for the Era of Hybrid LLMs

Paper • 2411.19379 • Published Nov 28, 2024 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs