Multimodal Art Projection

community

https://m-a-p.ai

multimodal-art-projection

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

wanng updated a dataset 8 minutes ago

m-a-p/PIN-100M

MING-ZCH authored a paper 2 days ago

II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models

wwwbxy123 authored a paper 2 days ago

Muskits-ESPnet: A Comprehensive Toolkit for Singing Voice Synthesis in New Paradigm

View all activity

m-a-p's activity

wanng

updated a dataset 8 minutes ago

m-a-p/PIN-100M

Viewer • Updated 8 minutes ago • 68.1k • 2.02k • 2

MING-ZCH

authored a paper 2 days ago

II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models

Paper • 2406.05862 • Published Jun 9 • 4

wwwbxy123

authored 3 papers 2 days ago

Muskits-ESPnet: A Comprehensive Toolkit for Singing Voice Synthesis in New Paradigm

Paper • 2409.07226 • Published Sep 11

Towards Rationality in Language and Multimodal Agents: A Survey

Paper • 2406.00252 • Published Jun 1

MMMT-IF: A Challenging Multimodal Multi-Turn Instruction Following Benchmark

Paper • 2409.18216 • Published Sep 26

Liam-Liu

authored a paper 7 days ago

OmniHD-Scenes: A Next-Generation Multimodal Dataset for Autonomous Driving

Paper • 2412.10734 • Published 11 days ago

Liam-Liu

authored a paper 14 days ago

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

Paper • 2412.07626 • Published 15 days ago • 20

wenhu

authored a paper 16 days ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published 19 days ago • 45

yuexiang96

authored 4 papers 16 days ago

JMMMU: A Japanese Massive Multi-discipline Multimodal Understanding Benchmark for Culture-aware Evaluation

Paper • 2410.17250 • Published Oct 22 • 14

Long Context Alignment with Short Instructions and Synthesized Positions

Paper • 2405.03939 • Published May 7

LIME: Less Is More for MLLM Evaluation

Paper • 2409.06851 • Published Sep 10

Machine Unlearning of Pre-trained Large Language Models

Paper • 2402.15159 • Published Feb 23

aaabiao

authored a paper 16 days ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published 19 days ago • 45

yuexiang96

authored a paper 16 days ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published 19 days ago • 45

yuexiang96

authored a paper 19 days ago

Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published 20 days ago • 43

CheeryLJH

authored a paper 22 days ago

PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos

Paper • 2412.01800 • Published 22 days ago • 6

wenhu

authored a paper 22 days ago

VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation

Paper • 2412.00927 • Published 24 days ago • 26

zhangysk

authored a paper 22 days ago

PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos

Paper • 2412.01800 • Published 22 days ago • 6

zhangysk

authored a paper about 1 month ago

OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

Paper • 2411.07199 • Published Nov 11 • 45

SivilTaram

authored a paper about 1 month ago

When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training

Paper • 2411.13476 • Published Nov 20 • 15