Inui

Norm

https://normxu.github.io/

AI & ML interests

Video Diffusion; Large Language Model; Object Detection; OCR

Recent Activity

liked a model 13 days ago

hao9610/X2SAM

liked a model 27 days ago

deepseek-ai/DeepSeek-V4-Pro

liked a model about 1 month ago

tiiuae/Falcon-Perception

View all activity

Organizations

liked a model 13 days ago

hao9610/X2SAM

Updated 15 days ago • 4

liked a model 27 days ago

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 15 days ago • 3.82M • • 4.08k

liked a model about 1 month ago

tiiuae/Falcon-Perception

Mask Generation • 0.6B • Updated 9 days ago • 11.8k • 121

liked a model about 2 months ago

Logics-MLLM/Logics-Parsing-Omni

32B • Updated Apr 8 • 100 • 10

upvoted a paper about 2 months ago

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 184

liked a dataset 3 months ago

NoobEngineere/NSFW_Manga

Viewer • Updated Feb 23 • 596k • 323 • 8

upvoted a paper 4 months ago

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published Jan 23 • 180

liked a model 4 months ago

meituan-longcat/LongCat-Flash-Thinking-2601

Text Generation • 562B • Updated Jan 23 • 13k • 112

liked 2 datasets 5 months ago

wsdwJohn1231/DreamLIP_capion_csv_w_key

Viewer • Updated Dec 2, 2025 • 13M • 21 • 1

Jyuhamdik/RealSyn15M

Viewer • Updated Dec 18, 2025 • 15.2M • 121 • 1

upvoted 2 papers 7 months ago

Revisiting Multimodal Positional Encoding in Vision-Language Models

Paper • 2510.23095 • Published Oct 27, 2025 • 22

LongCat-Flash-Omni Technical Report

Paper • 2511.00279 • Published Oct 31, 2025 • 26

liked a model 7 months ago

meituan-longcat/LongCat-Flash-Omni

Any-to-Any • 561B • Updated Nov 11, 2025 • 46 • 110

upvoted a paper 7 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 514

liked a model 8 months ago

rednote-hilab/dots.ocr

Image-Text-to-Text • 3B • Updated Oct 31, 2025 • 208k • 1.31k

liked a model 9 months ago

meituan-longcat/LongCat-Flash-Chat

Text Generation • 562B • Updated Sep 24, 2025 • 83.8k • 533

upvoted a paper 9 months ago

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 170

liked a model 9 months ago

microsoft/VibeVoice-1.5B

Text-to-Speech • 3B • Updated Jan 22 • 203k • 2.38k

upvoted a paper 9 months ago

Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models

Paper • 2508.09138 • Published Aug 12, 2025 • 37

liked a model 9 months ago

Qwen/Qwen-Image-Edit

Image-to-Image • Updated Aug 25, 2025 • 67k • • 2.39k

Inui

AI & ML interests

Recent Activity

Organizations

Norm's activity