Self-Taught Self-Correction for Small Language Models Paper • 2503.08681 • Published 24 days ago • 13
One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation Paper • 2503.13358 • Published 18 days ago • 90
OLMo 2 Collection Artifacts for the second set of OLMo models. • 27 items • Updated 15 days ago • 107
SynthDetoxM Collection Data and models from NAACL 2025 paper "SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators" by Moskovskiy et al. • 4 items • Updated 30 days ago • 2
Knowledge Packing Collection Models and datasets from the paper: "How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?" https://arxiv.org/abs/2502.14502 • 9 items • Updated Feb 25 • 2
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper • 2502.15007 • Published Feb 20 • 169
How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM? Paper • 2502.14502 • Published Feb 20 • 88
Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity Paper • 2502.13063 • Published Feb 18 • 68
StyleRemix: Interpretable Authorship Obfuscation via Distillation and Perturbation of Style Elements Paper • 2408.15666 • Published Aug 28, 2024 • 11
POGEMA: A Benchmark Platform for Cooperative Multi-Agent Navigation Paper • 2407.14931 • Published Jul 20, 2024 • 22
SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators Paper • 2502.06394 • Published Feb 10 • 89
Methods for Detoxification of Texts for the Russian Language Paper • 2105.09052 • Published May 19, 2021 • 1
PseudoParaDetox Collection Models and datasets from the paper: "LLMs to Replace Crowdsourcing For Parallel Data Creation? The Case of Text Detoxification" by Moskovskiy et al. • 9 items • Updated 26 days ago • 1
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval Paper • 2412.14475 • Published Dec 19, 2024 • 54
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation Paper • 2412.06531 • Published Dec 9, 2024 • 71