Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection Paper โข 2405.10300 โข Published May 16, 2024 โข 29
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper โข 2311.06242 โข Published Nov 10, 2023 โข 93
Learning and Leveraging World Models in Visual Representation Learning Paper โข 2403.00504 โข Published Mar 1, 2024 โข 33