21 48 16

Yuhao Dong

THUdyh

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Synthetic Video Enhances Physical Fidelity in Video Synthesis

upvoted a paper 9 days ago

VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness

liked a dataset 11 days ago

Osilly/Vision-R1-cold

View all activity

Organizations

THUdyh's activity

upvoted a paper 3 days ago

Synthetic Video Enhances Physical Fidelity in Video Synthesis

Paper • 2503.20822 • Published 11 days ago • 15

upvoted a paper 9 days ago

VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness

Paper • 2503.21755 • Published 9 days ago • 30

liked a dataset 11 days ago

Osilly/Vision-R1-cold

Preview • Updated 13 days ago • 170 • 7

authored a paper 30 days ago

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published Mar 5 • 38

upvoted a paper 30 days ago

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published Mar 5 • 38

updated a Space about 1 month ago

Ola

📊

Generate text and audio responses from images and videos

New activity in THUdyh/Ola about 1 month ago

The Gradio demo encountered a Runtime Error.

#2 opened about 1 month ago by

danieldeng

New activity in THUdyh/Ola-7b about 1 month ago

Low bit version issues

#7 opened about 1 month ago by

Jilt

upvoted a collection about 1 month ago

EgoLife

Collection

CVPR 2025 - EgoLife: Towards Egocentric Life Assistant. Homepage: https://egolife-ai.github.io/ • 10 items • Updated 30 days ago • 16

New activity in THUdyh/Oryx-1.5-7B about 1 month ago

Improve Model Card: Correct pipeline tag, add library name and project page link

#1 opened about 1 month ago by

nielsr

New activity in THUdyh/Oryx-ViT about 1 month ago

Improve model card

#2 opened about 1 month ago by

nielsr

New activity in THUdyh/Oryx-34B about 1 month ago

Improve Model Card: Correct pipeline tag and add library name

#1 opened about 1 month ago by

nielsr

liked 2 datasets about 1 month ago

SynthLabsAI/Big-Math-RL-Verified

Viewer • Updated 11 days ago • 251k • 6.33k • 163

PrimeIntellect/SYNTHETIC-1-SFT-Data

Viewer • Updated Feb 21 • 894k • 957 • 27

New activity in THUdyh/Oryx-ViT about 1 month ago

visual embedding and text embedding projection

#1 opened about 1 month ago by

MonoLeon

New activity in THUdyh/Ola-Image about 1 month ago

Add pipeline tag

#1 opened about 1 month ago by

nielsr

New activity in THUdyh/Ola-Video about 1 month ago

Add pipeline tag

#1 opened about 1 month ago by

nielsr

New activity in THUdyh/Ola_speech_encoders about 1 month ago

Add model card

#1 opened about 1 month ago by

nielsr

New activity in THUdyh/Ola-Data about 1 month ago

Add dataset card

#2 opened about 1 month ago by

nielsr

posted an update about 1 month ago

Post

2126

🔥🔥Introducing Ola! State-of-the-art omni-modal understanding model with advanced progressive modality alignment strategy!
Ola ranks #1 on OpenCompass Leaderboard (<10B)
.
📜Paper: https://arxiv.org/abs/2502.04328
🛠️Code: https://github.com/Ola-Omni/Ola

🛠️We have fully released our video&audio training data, intermediate image&video model at THUdyh/ola-67b8220eb93406ec87aeec37. Try to build your own powerful omni-modal model with our data and models!