Yang Shi

DogNeverSleep

15 48 2

https://FrankYang-17.github.io/

FrankYang-17

AI & ML interests

👨🏻‍🎓PhD student at Peking University

Recent Activity

authored a paper 5 days ago

CapRiCorn-1K: A Comprehensive Benchmark for Video Captioning and Subject Referential Consistency Across Temporal Scales

authored a paper 5 days ago

DOPD: Dual On-policy Distillation

upvoted a paper 6 days ago

DOPD: Dual On-policy Distillation

View all activity

Organizations

authored 2 papers 5 days ago

CapRiCorn-1K: A Comprehensive Benchmark for Video Captioning and Subject Referential Consistency Across Temporal Scales

Paper • 2606.21949 • Published 17 days ago

DOPD: Dual On-policy Distillation

Paper • 2606.30626 • Published 8 days ago • 102

upvoted a paper 6 days ago

DOPD: Dual On-policy Distillation

Paper • 2606.30626 • Published 8 days ago • 102

updated 2 datasets 17 days ago

ThinkingRM/Edit-Review

Viewer • Updated 16 days ago • 625 • 1.13k

ThinkingRM/Generation-Review

Viewer • Updated 17 days ago • 510 • 754

published 2 datasets 17 days ago

ThinkingRM/Generation-Review

Viewer • Updated 17 days ago • 510 • 754

ThinkingRM/Edit-Review

Viewer • Updated 16 days ago • 625 • 1.13k

published a dataset 28 days ago

KeyFrame-Review/Data-301-377

Viewer • Updated 28 days ago • 2.45k • 49

upvoted 2 papers about 1 month ago

VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding

Paper • 2606.05259 • Published Jun 3 • 39

LoomVideo: Unifying Multimodal Inputs into Video Generation and Editing

Paper • 2606.06042 • Published Jun 4 • 24

updated a dataset about 1 month ago

KeyFrame-Review/Review-Data

Viewer • Updated Jun 3 • 12.2k • 23

published a dataset about 1 month ago

KeyFrame-Review/Review-Data

Viewer • Updated Jun 3 • 12.2k • 23

upvoted 2 papers about 1 month ago

DecMem: Towards Minute-Long Consistent World Generation with Decoupled Memory

Paper • 2605.31336 • Published May 29 • 12

minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models

Paper • 2605.30263 • Published May 28 • 59

authored a paper about 1 month ago

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Paper • 2605.26244 • Published May 25 • 38

upvoted a paper about 1 month ago

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Paper • 2605.26244 • Published May 25 • 38

submitted a paper to Daily Papers about 1 month ago

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Paper • 2605.26244 • Published May 25 • 38

upvoted a paper about 1 month ago

Channel-wise Vector Quantization

Paper • 2605.26089 • Published May 25 • 15

authored a paper about 2 months ago

LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning

Paper • 2605.22012 • Published May 21 • 46

upvoted a paper about 2 months ago

LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning

Paper • 2605.22012 • Published May 21 • 46

Yang Shi

AI & ML interests

Recent Activity

Organizations

DogNeverSleep's activity