2 5 36

Xiaojian Ma

jeasinema

http://jeasinema.github.io

AI & ML interests

None yet

Recent Activity

liked a dataset 10 days ago

jasonzhango/SPAR-7M

authored a paper 23 days ago

JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse

liked a model about 1 month ago

PengxiangLi/MAT

View all activity

Organizations

jeasinema's activity

liked a dataset 10 days ago

jasonzhango/SPAR-7M

Preview • Updated 15 days ago • 50 • 2

authored a paper 23 days ago

JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse

Paper • 2503.16365 • Published 24 days ago • 38

liked a model about 1 month ago

PengxiangLi/MAT

Visual Question Answering • Updated Mar 3 • 15 • 2

liked a dataset 3 months ago

agibot-world/AgiBotWorld-Alpha

Viewer • Updated 17 minutes ago • 20M • 21.8k • 186

upvoted a paper 4 months ago

Deliberation in Latent Space via Differentiable Cache Augmentation

Paper • 2412.17747 • Published Dec 23, 2024 • 33

liked a Space 5 months ago

MoGe

🏆

MoGe live demo

authored a paper 6 months ago

ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting

Paper • 2410.17856 • Published Oct 23, 2024 • 52

liked 2 models 6 months ago

LanguageBind/Open-Sora-Plan-v1.3.0

Text-to-Video • Updated Dec 5, 2024 • 70

genmo/mochi-1-preview

Text-to-Video • Updated Dec 18, 2024 • 23.9k • • 1.2k

upvoted a paper 6 months ago

Selecting Influential Samples for Long Context Alignment via Homologous Models' Guidance and Contextual Awareness Measurement

Paper • 2410.15633 • Published Oct 21, 2024 • 7

liked a model 6 months ago

MeissonFlow/Meissonic

Text-to-Image • Updated Dec 5, 2024 • 49 • 102

upvoted a paper 6 months ago

A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation

Paper • 2410.01912 • Published Oct 2, 2024 • 14

authored a paper 8 months ago

Task-oriented Sequential Grounding in 3D Scenes

Paper • 2408.04034 • Published Aug 7, 2024 • 8

updated a Space 8 months ago

UltraEdit SD3

🖼

Edit images using text prompts and masks

liked 2 datasets 9 months ago

BleachNick/UltraEdit_500k

Viewer • Updated Jul 22, 2024 • 500k • 4.88k • 13

BleachNick/UltraEdit_Region_Based_100k

Viewer • Updated Jul 22, 2024 • 108k • 736 • 8

liked a model 9 months ago

BleachNick/SD3_UltraEdit_w_mask

Text-to-Image • Updated Jun 30, 2024 • 1.15k • 12

authored a paper 9 months ago

UltraEdit: Instruction-based Fine-Grained Image Editing at Scale

Paper • 2407.05282 • Published Jul 7, 2024 • 15

liked a Space 9 months ago

UltraEdit SD3

🖼

Edit images using text prompts and masks

authored a paper 9 months ago

OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents

Paper • 2407.00114 • Published Jun 27, 2024 • 13