Hanshi's picture

2 3 1

Hanshi

preminstrel

·

https://preminstrel.com

AI & ML interests

ML

Organizations

None yet

preminstrel's activity

upvoted a paper about 1 hour ago

ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference

Paper • 2410.21465 • Published 1 day ago • 1

upvoted a paper about 20 hours ago

Fast Best-of-N Decoding via Speculative Rejection

Paper • 2410.20290 • Published 3 days ago • 7

upvoted a paper 6 months ago

TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

Paper • 2404.11912 • Published Apr 18 • 16