Synthetic Video Enhances Physical Fidelity in Video Synthesis Paper • 2503.20822 • Published 11 days ago • 15
VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness Paper • 2503.21755 • Published 9 days ago • 30
EgoLife Collection CVPR 2025 - EgoLife: Towards Egocentric Life Assistant. Homepage: https://egolife-ai.github.io/ • 10 items • Updated 30 days ago • 16
view post Post 2126 🔥🔥Introducing Ola! State-of-the-art omni-modal understanding model with advanced progressive modality alignment strategy!Ola ranks #1 on OpenCompass Leaderboard (<10B). 📜Paper: https://arxiv.org/abs/2502.04328🛠️Code: https://github.com/Ola-Omni/Ola🛠️We have fully released our video&audio training data, intermediate image&video model at THUdyh/ola-67b8220eb93406ec87aeec37. Try to build your own powerful omni-modal model with our data and models! See translation 👀 4 4 + Reply