-
How Far Are We from Intelligent Visual Deductive Reasoning?
Paper • 2403.04732 • Published • 19 -
MoAI: Mixture of All Intelligence for Large Language and Vision Models
Paper • 2403.07508 • Published • 74 -
DragAnything: Motion Control for Anything using Entity Representation
Paper • 2403.07420 • Published • 13 -
Learning and Leveraging World Models in Visual Representation Learning
Paper • 2403.00504 • Published • 31
Jue Zhang
JueZhang
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
29 days ago
VisualLens: Personalization through Visual History
upvoted
a
paper
about 1 month ago
Is Your LLM Secretly a World Model of the Internet? Model-Based Planning
for Web Agents
Organizations
None yet
Collections
3
models
None public yet
datasets
None public yet