Sequential Modeling Enables Scalable Learning for Large Vision Models Paper • 2312.00785 • Published Dec 1, 2023 • 1
EgoPet: Egomotion and Interaction Data from an Animal's Perspective Paper • 2404.09991 • Published Apr 15, 2024
Forgotten Polygons: Multimodal Large Language Models are Shape-Blind Paper • 2502.15969 • Published Feb 21 • 2
Sonata: Self-Supervised Learning of Reliable Point Representations Paper • 2503.16429 • Published 26 days ago • 11
Audiobox: Unified Audio Generation with Natural Language Prompts Paper • 2312.15821 • Published Dec 25, 2023 • 17
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale Paper • 2111.09296 • Published Nov 17, 2021 • 3
Nix-TTS: Lightweight and End-to-End Text-to-Speech via Module-wise Distillation Paper • 2203.15643 • Published Mar 29, 2022 • 1
CLUTRR: A Diagnostic Benchmark for Inductive Reasoning from Text Paper • 1908.06177 • Published Aug 16, 2019
Learning an Unreferenced Metric for Online Dialogue Evaluation Paper • 2005.00583 • Published May 1, 2020