WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs Paper • 2502.04326 • Published Feb 6 • 2
SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable Manners Paper • 2408.16768 • Published Aug 29, 2024 • 28