1 16

hangyu guo

Rosiness

https://github.com/pygh0er

pygh0er

AI & ML interests

Natural Language Processing

Recent Activity

upvoted a paper 1 day ago

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

upvoted a paper 1 day ago

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

upvoted a paper 9 days ago

Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models

View all activity

Organizations

None yet

Rosiness's activity

upvoted 2 papers 1 day ago

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published 2 days ago • 49

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published 6 days ago • 41

upvoted 3 papers 9 days ago

Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models

Paper • 2503.18923 • Published 9 days ago • 11

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published 9 days ago • 27

A Comprehensive Survey on Long Context Language Modeling

Paper • 2503.17407 • Published 13 days ago • 47

upvoted 2 papers 22 days ago

Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model

Paper • 2503.07703 • Published 23 days ago • 34

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published 22 days ago • 60

upvoted 2 papers about 1 month ago

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Paper • 2502.19361 • Published Feb 26 • 27

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 99

upvoted 2 papers about 2 months ago

Distillation Scaling Laws

Paper • 2502.08606 • Published Feb 12 • 46

Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published Feb 7 • 103

upvoted a paper 2 months ago

Taming Teacher Forcing for Masked Autoregressive Video Generation

Paper • 2501.12389 • Published Jan 21 • 10

authored a paper 4 months ago

PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos

Paper • 2412.01800 • Published Dec 2, 2024 • 6

upvoted a paper 5 months ago

Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models

Paper • 2411.07140 • Published Nov 11, 2024 • 34

authored a paper 6 months ago

ING-VP: MLLMs cannot Play Easy Vision-based Games Yet

Paper • 2410.06555 • Published Oct 9, 2024 • 8

upvoted a paper 6 months ago

ING-VP: MLLMs cannot Play Easy Vision-based Games Yet

Paper • 2410.06555 • Published Oct 9, 2024 • 8

commented a paper 6 months ago

ING-VP: MLLMs cannot Play Easy Vision-based Games Yet

Paper • 2410.06555 • Published Oct 9, 2024 • 8 •

authored a paper 6 months ago

OmniBench: Towards The Future of Universal Omni-Language Models

Paper • 2409.15272 • Published Sep 23, 2024 • 29

upvoted a paper 6 months ago

OmniBench: Towards The Future of Universal Omni-Language Models

Paper • 2409.15272 • Published Sep 23, 2024 • 29

updated a model 9 months ago

Rosiness/GeoGPT4V-InternVL-Chat-40B

Feature Extraction • Updated Jun 28, 2024 • 6 • 1