The GAN is dead; long live the GAN! A Modern GAN Baseline Paper • 2501.05441 • Published 17 days ago • 84
VideoRAG: Retrieval-Augmented Generation over Video Corpus Paper • 2501.05874 • Published 16 days ago • 66
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature Paper • 2501.07171 • Published 13 days ago • 49
Potential and Perils of Large Language Models as Judges of Unstructured Textual Data Paper • 2501.08167 • Published 12 days ago • 6
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published 12 days ago • 268
MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents Paper • 2501.08828 • Published 11 days ago • 28
Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization Paper • 2412.18525 • Published Dec 24, 2024 • 70