4 18 2

Xiuyu Li

xiuyul

https://xiuyuli.com/

AI & ML interests

None yet

Recent Activity

upvoted a paper 21 days ago

Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization

upvoted a paper 3 months ago

Arbitrage: Efficient Reasoning via Advantage-Aware Speculation

upvoted a paper 3 months ago

ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models

View all activity

Organizations

upvoted a paper 21 days ago

Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization

Paper • 2602.02958 • Published 23 days ago • 33

upvoted 2 papers 3 months ago

Arbitrage: Efficient Reasoning via Advantage-Aware Speculation

Paper • 2512.05033 • Published Dec 4, 2025 • 17

ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models

Paper • 2512.07843 • Published Nov 24, 2025 • 22

updated a model 4 months ago

xiuyul/deepcoder-sandbox-Qwen3-4B-Instruct-2507-32rank-4e-05lr-step100-merged

Text Generation • 4B • Updated Nov 10, 2025 • 1

published a model 4 months ago

xiuyul/deepcoder-sandbox-Qwen3-4B-Instruct-2507-32rank-4e-05lr-step100-merged

Text Generation • 4B • Updated Nov 10, 2025 • 1

updated a model 4 months ago

xiuyul/deepcoder-sandbox-Qwen3-4B-Instruct-2507-32rank-4e-05lr-step80-merged

Text Generation • 4B • Updated Nov 9, 2025 • 1

published 2 models 4 months ago

xiuyul/deepcoder-sandbox-Qwen3-4B-Instruct-2507-32rank-4e-05lr-step80-merged

Text Generation • 4B • Updated Nov 9, 2025 • 1

xiuyul/deepcoder-Qwen-Qwen3-4B-Instruct-2507-32rank-4e-05lr-8group-128batch-1_0temp-seed0-merged

Text Generation • 4B • Updated Nov 3, 2025 • 4

updated a model 4 months ago

xiuyul/deepcoder-Qwen-Qwen3-4B-Instruct-2507-32rank-4e-05lr-8group-128batch-1_0temp-seed0-merged

Text Generation • 4B • Updated Nov 3, 2025 • 4

upvoted a paper 6 months ago

XQuant: Breaking the Memory Wall for LLM Inference with KV Cache Rematerialization

Paper • 2508.10395 • Published Aug 14, 2025 • 42

updated a dataset 6 months ago

Parallel-Reasoning/apr_sft_data

Preview • Updated Aug 15, 2025 • 22 • 1

published a dataset 6 months ago

Parallel-Reasoning/apr_sft_data

Preview • Updated Aug 15, 2025 • 22 • 1

updated a dataset 6 months ago

Parallel-Reasoning/sosp_sft_data

Viewer • Updated Aug 15, 2025 • 500k • 28

published a dataset 6 months ago

Parallel-Reasoning/sosp_sft_data

Viewer • Updated Aug 15, 2025 • 500k • 28

updated a dataset 6 months ago

Parallel-Reasoning/countdown_problems

Viewer • Updated Aug 15, 2025 • 501k • 120

published a dataset 6 months ago

Parallel-Reasoning/countdown_problems

Viewer • Updated Aug 15, 2025 • 501k • 120

authored a paper 8 months ago

SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity

Paper • 2506.16500 • Published Jun 19, 2025 • 16

upvoted 2 papers 8 months ago

SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity

Paper • 2506.16500 • Published Jun 19, 2025 • 16

Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation

Paper • 2506.09991 • Published Jun 11, 2025 • 55

updated a model 9 months ago

LM-Parallel/llama-hsp-v3n5-b6subb8_th0_9-8k-fla_num_subcall_cond

0.3B • Updated Jun 10, 2025 • 1

Xiuyu Li

AI & ML interests

Recent Activity

Organizations

xiuyul's activity