6 24 119

Jaeyoon Jung PRO

lastdefiance20

AI & ML interests

multimodal

Recent Activity

upvoted a paper 7 days ago

SmolVLM: Redefining small and efficient multimodal models

upvoted a paper 8 days ago

R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model

liked a model 12 days ago

Skywork/SkyReels-A2

View all activity

Organizations

lastdefiance20's activity

upvoted a paper 7 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 8 days ago • 158

upvoted a paper 8 days ago

R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model

Paper • 2503.05132 • Published Mar 7 • 55

liked a model 12 days ago

Skywork/SkyReels-A2

Updated 8 days ago • 544 • 110

liked a model 13 days ago

weizhiwang/Open-Qwen2VL

Image-Text-to-Text • Updated about 4 hours ago • 386 • 13

authored a paper 15 days ago

KOFFVQA: An Objectively Evaluated Free-form VQA Benchmark for Large Vision-Language Models in the Korean Language

Paper • 2503.23730 • Published 16 days ago • 4

updated a Space 15 days ago

KOFFVQA Leaderboard

🏆

Explore and filter AI models on a leaderboard

updated a dataset 15 days ago

maum-ai/KOFFVQA_Data

Viewer • Updated 13 days ago • 275 • 92 • 2

upvoted a paper 15 days ago

KOFFVQA: An Objectively Evaluated Free-form VQA Benchmark for Large Vision-Language Models in the Korean Language

Paper • 2503.23730 • Published 16 days ago • 4

commented a paper 15 days ago

KOFFVQA: An Objectively Evaluated Free-form VQA Benchmark for Large Vision-Language Models in the Korean Language

Paper • 2503.23730 • Published 16 days ago • 4 •

upvoted a paper 17 days ago

Gemma 3 Technical Report

Paper • 2503.19786 • Published 21 days ago • 45

liked 2 models 20 days ago

Qwen/Qwen2.5-Omni-7B

Any-to-Any • Updated about 16 hours ago • 135k • 1.38k

ByteDance/InfiniteYou

Text-to-Image • Updated 7 minutes ago • 10.7k • 569

liked a dataset 21 days ago

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated 7 days ago • 3.91M • 4.15k • 396

upvoted a paper 25 days ago

JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse

Paper • 2503.16365 • Published 26 days ago • 38

liked a model 26 days ago

mistralai/Mistral-Small-3.1-24B-Instruct-2503

Image-Text-to-Text • Updated 7 days ago • 141k • • 1.12k

liked a dataset about 1 month ago

facebook/natural_reasoning

Viewer • Updated Feb 21 • 1.15M • 8.15k • 490

liked a model about 1 month ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • Updated 7 days ago • 800k • 1.3k

liked a model about 2 months ago

kakaocorp/kanana-nano-2.1b-instruct

Text Generation • Updated Feb 27 • 6.91k • 57

liked a Space about 2 months ago

KOFFVQA Leaderboard

🏆

Explore and filter AI models on a leaderboard

liked a dataset about 2 months ago

allenai/olmOCR-mix-0225

Viewer • Updated Feb 25 • 259k • 2.68k • 118