FreeGaussian: Annotation-free Controllable 3D Gaussian Splats with Flow Derivatives Paper • 2410.22070 • Published Oct 29, 2024
Uni$\textbf{F}^2$ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models Paper • 2503.08120 • Published 2 days ago • 26
UniF^2ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models Paper • 2503.08120 • Published 2 days ago • 26
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published 21 days ago • 129
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control Feb 4 • 111
LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control Paper • 2406.16038 • Published Jun 23, 2024 • 1
SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model Paper • 2501.15830 • Published Jan 27 • 14 • 1
Towards Nonlinear-Motion-Aware and Occlusion-Robust Rolling Shutter Correction Paper • 2303.18125 • Published Mar 31, 2023
GS-SLAM: Dense Visual SLAM with 3D Gaussian Splatting Paper • 2311.11700 • Published Nov 20, 2023 • 4
LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control Paper • 2406.16038 • Published Jun 23, 2024 • 1
Fast-UMI: A Scalable and Hardware-Independent Universal Manipulation Interface Paper • 2409.19499 • Published Sep 29, 2024
SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model Paper • 2501.15830 • Published Jan 27 • 14
Exploring the Potential of Encoder-free Architectures in 3D LMMs Paper • 2502.09620 • Published 28 days ago • 25