Collections

Discover the best community collections!

Collections including paper arxiv:2412.04432
VisionLM
Collection by 5 days ago
video
Collection by about 6 hours ago
video LM
Collection by 1 day ago
Video
Collection by 12 days ago
Unified MLLM
Unified model that generate Text, Image, Video
Cognition
Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend.
daily papers
Collection by 2 days ago