Sparse Logit Sampling: Accelerating Knowledge Distillation in LLMs Paper • 2503.16870 • Published 27 days ago • 5