InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions Paper • 2412.09596 • Published Dec 12, 2024 • 94
Art-Free Generative Models: Art Creation Without Graphic Art Knowledge Paper • 2412.00176 • Published Nov 29, 2024 • 8
Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation Paper • 2407.12489 • Published Jul 17, 2024
DebSDF: Delving into the Details and Bias of Neural Indoor Scene Reconstruction Paper • 2308.15536 • Published Aug 29, 2023
CAD-MLLM: Unifying Multimodality-Conditioned CAD Generation With MLLM Paper • 2411.04954 • Published Nov 7, 2024 • 8
Efficient Detection of Toxic Prompts in Large Language Models Paper • 2408.11727 • Published Aug 21, 2024 • 13
ZePo: Zero-Shot Portrait Stylization with Faster Sampling Paper • 2408.05492 • Published Aug 10, 2024 • 7
P$^2$OT: Progressive Partial Optimal Transport for Deep Imbalanced Clustering Paper • 2401.09266 • Published Jan 17, 2024
SP$^2$OT: Semantic-Regularized Progressive Partial Optimal Transport for Imbalanced Clustering Paper • 2404.03446 • Published Apr 4, 2024
Cascaded Sparse Feature Propagation Network for Interactive Segmentation Paper • 2203.05145 • Published Mar 10, 2022
DressCode: Autoregressively Sewing and Generating Garments from Text Guidance Paper • 2401.16465 • Published Jan 29, 2024 • 12
Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance Paper • 2401.15687 • Published Jan 28, 2024 • 23