Jiwon Song's picture

Jiwon Song

jiwonsong

·

AI & ML interests

AI Compression & Acceleration

Recent Activity

upvoted a paper about 2 months ago

Retrospective Sparse Attention for Efficient Long-Context Generation

updated a collection about 2 months ago

upvoted a paper about 2 months ago

Kimi Linear: An Expressive, Efficient Attention Architecture

View all activity

Organizations

authored a paper 2 months ago

LiteStage: Latency-aware Layer Skipping for Multi-stage Reasoning

Paper • 2510.14211 • Published Oct 16 • 7

authored a paper 7 months ago

Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning

Paper • 2505.13866 • Published May 20 • 17

authored a paper 11 months ago

FastKV: KV Cache Compression for Fast Long-Context Processing with Token-Selective Propagation

Paper • 2502.01068 • Published Feb 3 • 18

authored a paper over 1 year ago

SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks

Paper • 2402.09025 • Published Feb 14, 2024 • 9