multi-token

community

AI & ML interests

None defined yet.

Recent Activity

Snyhlxde authored a paper about 1 month ago

TrimLLM: Progressive Layer Dropping for Domain-Specific LLMs

Snyhlxde authored a paper about 1 month ago

ReFoRCE: A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Exploration

Snyhlxde authored a paper 2 months ago

GameArena: Evaluating LLM Reasoning through Live Computer Games

View all activity

multi-token's activity

Snyhlxde

authored 2 papers about 1 month ago

TrimLLM: Progressive Layer Dropping for Domain-Specific LLMs

Paper • 2412.11242 • Published Dec 15, 2024 • 1

ReFoRCE: A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Exploration

Paper • 2502.00675 • Published Feb 2 • 1

Snyhlxde

authored a paper 2 months ago

GameArena: Evaluating LLM Reasoning through Live Computer Games

Paper • 2412.06394 • Published Dec 9, 2024

Viol2000

authored 3 papers 4 months ago

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

Paper • 2406.05981 • Published Jun 10, 2024 • 16

When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models

Paper • 2406.07368 • Published Jun 11, 2024 • 2

Efficiently Serving LLM Reasoning Programs with Certaindex

Paper • 2412.20993 • Published Dec 30, 2024 • 38

Viol2000

authored 2 papers 8 months ago

Efficient LLM Scheduling by Learning to Rank

Paper • 2408.15792 • Published Aug 28, 2024 • 21

Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Paper • 2402.02057 • Published Feb 3, 2024

Snyhlxde

authored 4 papers 10 months ago

PockEngine: Sparse and Efficient Fine-tuning in a Pocket

Paper • 2310.17752 • Published Oct 26, 2023 • 14

Online Speculative Decoding

Paper • 2310.07177 • Published Oct 11, 2023 • 2

CLLMs: Consistency Large Language Models

Paper • 2403.00835 • Published Feb 28, 2024 • 4

Optimizing Speculative Decoding for Serving Large Language Models Using Goodput

Paper • 2406.14066 • Published Jun 20, 2024 • 2