Delin Qu's picture

4 9 7

Delin Qu

delinqu

·

https://delinqu.github.io/

AI & ML interests

Embodied AI, 3D Vision

Recent Activity

authored a paper 1 day ago

FreeGaussian: Annotation-free Controllable 3D Gaussian Splats with Flow Derivatives

authored a paper 1 day ago

Uni$\textbf{F}^2$ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models

upvoted a paper 1 day ago

UniF^2ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models

View all activity

Organizations

delinqu's activity

upvoted a paper 1 day ago

UniF^2ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models

Paper • 2503.08120 • Published 2 days ago • 26

upvoted an article 4 days ago

Article

4D masks support in Transformers

By

•

Jan 8, 2024

• 18

upvoted a paper 18 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 21 days ago • 129

upvoted an article 20 days ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Feb 4

• 111

upvoted a paper 21 days ago

LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control

Paper • 2406.16038 • Published Jun 23, 2024 • 1

upvoted a paper 24 days ago

Large Language Diffusion Models

Paper • 2502.09992 • Published 27 days ago • 103

upvoted a collection 27 days ago

Foundation Vision-language-action Model

3 items • Updated 12 days ago • 3

upvoted 2 papers 27 days ago

SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model

Paper • 2501.15830 • Published Jan 27 • 14

Exploring the Potential of Encoder-free Architectures in 3D LMMs

Paper • 2502.09620 • Published 28 days ago • 25