1 6

Yuhui Xu

yuhuixu

https://yuhuixu1993.github.io/

yuhuixu1993

AI & ML interests

None yet

Recent Activity

authored a paper 7 days ago

QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

authored a paper 7 days ago

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

upvoted a paper 7 days ago

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

View all activity

Organizations

None yet

yuhuixu's activity

authored 2 papers 7 days ago

QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

Paper • 2309.14717 • Published Sep 26, 2023 • 44

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Paper • 2501.19324 • Published 10 days ago • 33

upvoted a paper 7 days ago

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Paper • 2501.19324 • Published 10 days ago • 33

commented a paper 7 days ago

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Paper • 2501.19324 • Published 10 days ago • 33 •

updated a model 18 days ago

yuhuixu/merged_model_linear_0.6_0.4

Text Generation • Updated 18 days ago • 8

published a model 18 days ago

yuhuixu/merged_model_linear_0.6_0.4

Text Generation • Updated 18 days ago • 8

updated a model 18 days ago

yuhuixu/merged_model_linear_0.5_0.5

Text Generation • Updated 18 days ago • 6

published a model 18 days ago

yuhuixu/merged_model_linear_0.5_0.5

Text Generation • Updated 18 days ago • 6

updated a model 18 days ago

yuhuixu/merged_model_linear_0.4_0.6

Text Generation • Updated 18 days ago • 8

published a model 18 days ago

yuhuixu/merged_model_linear_0.4_0.6

Text Generation • Updated 18 days ago • 8

upvoted an article 18 days ago

Article

Mastering Long Contexts in LLMs with KVPress

and 1 other •

18 days ago

• 61

updated 3 models 27 days ago

upvoted 2 papers 4 months ago

MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code

Paper • 2410.08196 • Published Oct 10, 2024 • 46

MathHay: An Automated Benchmark for Long-Context Mathematical Reasoning in LLMs

Paper • 2410.04698 • Published Oct 7, 2024 • 13

authored a paper 4 months ago

MathHay: An Automated Benchmark for Long-Context Mathematical Reasoning in LLMs

Paper • 2410.04698 • Published Oct 7, 2024 • 13

authored 3 papers 6 months ago

Latency-Aware Differentiable Neural Architecture Search

Paper • 2001.06392 • Published Jan 17, 2020

PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search

Paper • 1907.05737 • Published Jul 12, 2019

Trained Rank Pruning for Efficient Deep Neural Networks

Paper • 1812.02402 • Published Dec 6, 2018 • 1