Submitted by akhaliq 51 LongVILA: Scaling Long-Context Visual Language Models for Long Videos · 18 authors 3
Submitted by NCJ 33 MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model · 12 authors 3
Submitted by akhaliq 17 Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data · 6 authors 3
Submitted by Study-is-happy 15 NeuFlow v2: High-Efficiency Optical Flow Estimation on Edge Devices · 4 authors 2
Submitted by akhaliq 13 SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views · 7 authors 2
Submitted by canyuchen 12 Authorship Attribution in the Era of LLMs: Problems, Methodologies, and Challenges · 3 authors 2
Submitted by akhaliq 11 Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering · 7 authors 2
Submitted by akhaliq 6 Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risk of Language Models · 27 authors 2