Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language Paper • 2306.16410 • Published Jun 28, 2023 • 28
Aladdin: Zero-Shot Hallucination of Stylized 3D Assets from Abstract Scene Descriptions Paper • 2306.06212 • Published Jun 9, 2023 • 9
FasterViT: Fast Vision Transformers with Hierarchical Attention Paper • 2306.06189 • Published Jun 9, 2023 • 30