Wan: Open and Advanced Large-Scale Video Generative Models Paper • 2503.20314 • Published 3 days ago • 38
LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning Paper • 2503.04812 • Published 25 days ago • 13
YuE: Scaling Open Foundation Models for Long-Form Music Generation Paper • 2503.08638 • Published 17 days ago • 60
Gemini Embedding: Generalizable Embeddings from Gemini Paper • 2503.07891 • Published 18 days ago • 34
LLM as a Broken Telephone: Iterative Generation Distorts Information Paper • 2502.20258 • Published 29 days ago • 26
CrowdSelect: Synthetic Instruction Data Selection with Multi-LLM Wisdom Paper • 2503.01836 • Published 25 days ago • 12
UniTok: A Unified Tokenizer for Visual Generation and Understanding Paper • 2502.20321 • Published 29 days ago • 29
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think Paper • 2502.20172 • Published 29 days ago • 28
CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale Paper • 2502.16645 • Published Feb 23 • 22
On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective Paper • 2502.14296 • Published Feb 20 • 45
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper • 2502.11089 • Published Feb 16 • 150
Preference Leakage: A Contamination Problem in LLM-as-a-judge Paper • 2502.01534 • Published Feb 3 • 39
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 366
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published Jan 14 • 282