Dongwon Jo
dongwonjo
AI & ML interests
Efficient AI, Model Compression, Quantization, Pruning, Generative Model, Large Language Model, Diffusion
Recent Activity
upvoted a paper about 1 month ago
Squeezing Large-Scale Diffusion Models for Mobile upvoted a paper about 1 month ago
SLEB: Streamlining LLMs through Redundancy Verification and Elimination
of Transformer Blocks upvoted a paper about 1 month ago
LiteStage: Latency-aware Layer Skipping for Multi-stage Reasoning