One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation Paper • 2503.13358 • Published Mar 17 • 96
Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders Paper • 2503.03601 • Published Mar 5 • 229
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper • 2502.15007 • Published Feb 20 • 174
Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients Paper • 2406.17660 • Published Jun 25, 2024 • 5 • 3
Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients Paper • 2406.17660 • Published Jun 25, 2024 • 5