new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Mar 19

Submitted by

nebulae09

Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM

·
12 authors

Submitted by

ZhangRC

RWKV-7 "Goose" with Expressive Dynamic State Evolution

·
15 authors

Submitted by

ZechenBai

Impossible Videos

·
3 authors

Submitted by

carboncoo

DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

·
8 authors

Submitted by

ZhaoyangLyu

Infinite Mobility: Scalable High-Fidelity Synthesis of Articulated Objects via Procedural Generation

·
12 authors

Submitted by

mathfinder

Frac-Connections: Fractional Extension of Hyper-Connections

·
8 authors

Submitted by

akhaliq

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

·
35 authors

Submitted by

Lingaaaaaaa

Temporal Consistency for LLM Reasoning Process Error Identification

·
7 authors

Submitted by

kpzhang996

MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification

·
9 authors

Submitted by

cckevinn

CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era

·
10 authors

1

Submitted by

akhaliq

Measuring AI Ability to Complete Long Tasks

·
25 authors

Submitted by

yifanzhang114

Aligning Multimodal LLM with Human Preference: A Survey

·
17 authors

2

Submitted by

akhaliq

Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control

·
39 authors

Submitted by

kpzhang996

PEBench: A Fictitious Dataset to Benchmark Machine Unlearning for Multimodal Large Language Models

·
11 authors

Submitted by

jacklishufan

Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection

·
7 authors

Submitted by

Mingtongz

KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation

·
3 authors

Submitted by

yuwendu

RoCo-Sim: Enhancing Roadside Collaborative Perception through Foreground Simulation

·
9 authors

1