Sparse Logit Sampling: Accelerating Knowledge Distillation in LLMs Paper • 2503.16870 • Published 25 days ago • 5 • 2