Beyond Uncertainty: Evidential Deep Learning for Robust Video Temporal Grounding Paper • 2408.16272 • Published Aug 29
UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the Web Paper • 2310.18340 • Published Oct 22, 2023
CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning Paper • 2404.09640 • Published Apr 15
OmniCreator: Self-Supervised Unified Generation with Universal Editing Paper • 2412.02114 • Published 23 days ago • 14
GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image Prompting Paper • 2405.07472 • Published May 13
FineCLIPER: Multi-modal Fine-grained CLIP for Dynamic Facial Expression Recognition with AdaptERs Paper • 2407.02157 • Published Jul 2
VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation Paper • 2412.02259 • Published 23 days ago • 59
OmniCreator: Self-Supervised Unified Generation with Universal Editing Paper • 2412.02114 • Published 23 days ago • 14 • 3
Identity-Preserving Text-to-Video Generation by Frequency Decomposition Paper • 2411.17440 • Published 30 days ago • 35
OmniCreator: Self-Supervised Unified Generation with Universal Editing Paper • 2412.02114 • Published 23 days ago • 14