-
Gemini Robotics: Bringing AI into the Physical World
Paper • 2503.20020 • Published • 23 -
Magma: A Foundation Model for Multimodal AI Agents
Paper • 2502.13130 • Published • 58 -
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper • 2311.05437 • Published • 50 -
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents
Paper • 2410.23218 • Published • 51
Jaehyun Jun
btjhjeon
AI & ML interests
Multimodal
Recent Activity
updated
a collection
about 4 hours ago
Multimodal Reasoning
updated
a collection
about 4 hours ago
Multimodal Reasoning
updated
a collection
1 day ago
Multimodal Benchmarks
Organizations
Collections
11
models
None public yet
datasets
None public yet