-
Gemini: A Family of Highly Capable Multimodal Models
Paper • 2312.11805 • Published • 44 -
Unlocking Pre-trained Image Backbones for Semantic Image Synthesis
Paper • 2312.13314 • Published • 7 -
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper • 2312.11514 • Published • 257 -
Amphion: An Open-Source Audio, Music and Speech Generation Toolkit
Paper • 2312.09911 • Published • 53
Collections
Discover the best community collections!
Collections including paper arxiv:2412.09626