2 7 12

Gaojunyao

Jeoyal

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

authored a paper 25 days ago

StyleShot: A Snapshot on Any Style

upvoted a paper 26 days ago

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

View all activity

Organizations

None yet

Gaojunyao's activity

upvoted a paper 6 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 7 days ago • 232

authored a paper 25 days ago

StyleShot: A Snapshot on Any Style

Paper • 2407.01414 • Published Jul 1, 2024 • 2

upvoted a paper 26 days ago

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

Paper • 2503.19990 • Published 27 days ago • 34

upvoted 2 papers 4 months ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 101

VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control

Paper • 2501.01427 • Published Jan 2 • 55

liked a Space 4 months ago

StyleShot

📊

liked a model 4 months ago

uwnlp/jpeg-lm-reimplementation

Updated Aug 24, 2024 • 6 • 1

liked a Space 4 months ago

StyleShot

📊

liked a model 6 months ago

stabilityai/stable-diffusion-2-1

Text-to-Image • Updated Jul 5, 2023 • 946k • 3.96k

New activity in nowsyn/StyleShot 7 months ago

could you restart the demo?

#2 opened 8 months ago by

HUG-NAN

500 error

#1 opened 9 months ago by

loong

upvoted a paper 7 months ago

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16, 2024 • 101

updated 2 models 7 months ago

Gaojunyao/StyleShot_lineart

Text-to-Image • Updated Sep 9, 2024 • 1

Gaojunyao/StyleShot

Text-to-Image • Updated Sep 9, 2024 • 14

liked a model 7 months ago

Leoxing/Live2Diff

Text-to-Video • Updated Jul 17, 2024 • 11

liked a model 9 months ago

MyNiuuu/MOFA-Video-Hybrid

Updated Jun 25, 2024 • 29

upvoted 2 papers 9 months ago

NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?

Paper • 2407.11963 • Published Jul 16, 2024 • 45

StyleShot: A Snapshot on Any Style

Paper • 2407.01414 • Published Jul 1, 2024 • 2

liked 2 Spaces 10 months ago

AnyControl

🏆

280

DynamiCrafter

🐨

Generate videos from images and text prompts