JL1-CD: A New Benchmark for Remote Sensing Change Detection and a Robust Multi-Teacher Knowledge Distillation Framework Paper • 2502.13407 • Published Feb 19 • 1
DEFT: Differentiable Branched Discrete Elastic Rods for Modeling Furcated DLOs in Real-Time Paper • 2502.15037 • Published Feb 20
Differentiable Discrete Elastic Rods for Real-Time Modeling of Deformable Linear Objects Paper • 2406.05931 • Published Jun 9, 2024
MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models Paper • 2501.02955 • Published Jan 6 • 45
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation Paper • 2412.21059 • Published Dec 30, 2024 • 19
A Survey on Dialog Management: Recent Advances and Challenges Paper • 2005.02233 • Published May 5, 2020
SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue Agents Paper • 2305.13040 • Published May 22, 2023 • 2
GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-Supervised Learning and Explicit Policy Injection Paper • 2111.14592 • Published Nov 29, 2021 • 1
Think, Act, and Ask: Open-World Interactive Personalized Robot Navigation Paper • 2310.07968 • Published Oct 12, 2023
Preview, Attend and Review: Schema-Aware Curriculum Learning for Multi-Domain Dialog State Tracking Paper • 2106.00291 • Published Jun 1, 2021
RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning Paper • 2409.14674 • Published Sep 23, 2024 • 44
CogVLM2: Visual Language Models for Image and Video Understanding Paper • 2408.16500 • Published Aug 29, 2024 • 58
Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions Paper • 2406.09264 • Published Jun 13, 2024 • 1
FIBER: Fill-in-the-Blanks as a Challenging Video Understanding Evaluation Framework Paper • 2104.04182 • Published Apr 9, 2021