Hibiki fr-en Collection Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated 3 days ago • 40
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper • 2502.01061 • Published 7 days ago • 164
VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models Paper • 2502.02492 • Published 5 days ago • 46
MatAnyone: Stable Video Matting with Consistent Memory Propagation Paper • 2501.14677 • Published 16 days ago • 28
People who frequently use ChatGPT for writing tasks are accurate and robust detectors of AI-generated text Paper • 2501.15654 • Published 14 days ago • 9
Histoires Morales: A French Dataset for Assessing Moral Alignment Paper • 2501.17117 • Published 12 days ago • 3
FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces Paper • 2501.12909 • Published 18 days ago • 63
OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-Conquer Paper • 2406.16620 • Published Jun 24, 2024 • 2
NeuralSVG: An Implicit Representation for Text-to-Vector Generation Paper • 2501.03992 • Published Jan 7 • 1
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos Paper • 2501.12375 • Published 19 days ago • 22
Textoon: Generating Vivid 2D Cartoon Characters from Text Descriptions Paper • 2501.10020 • Published 24 days ago • 22
MangaNinja: Line Art Colorization with Precise Reference Following Paper • 2501.08332 • Published 26 days ago • 56
RepVideo: Rethinking Cross-Layer Representation for Video Generation Paper • 2501.08994 • Published 25 days ago • 15
AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation Paper • 2501.09503 • Published 24 days ago • 13