2 15 8

ChengyouJia

AI & ML interests

None yet

Recent Activity

upvoted a paper 13 days ago

Long-Context Autoregressive Video Modeling with Next-Frame Prediction

upvoted a paper 19 days ago

φ-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation

upvoted a paper 20 days ago

CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era

View all activity

Organizations

None yet

ChengyouJia's activity

upvoted a paper 13 days ago

Long-Context Autoregressive Video Modeling with Next-Frame Prediction

Paper • 2503.19325 • Published 14 days ago • 71

upvoted a paper 19 days ago

φ-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation

Paper • 2503.13288 • Published 22 days ago • 49

upvoted a paper 20 days ago

CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era

Paper • 2503.12329 • Published 23 days ago • 24

upvoted 3 papers about 2 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 179

PhysReason: A Comprehensive Benchmark towards Physics-Based Reasoning

Paper • 2502.12054 • Published Feb 17 • 6

BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models

Paper • 2502.07346 • Published Feb 11 • 53

upvoted a paper 3 months ago

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published Dec 27, 2024 • 89

upvoted a paper 4 months ago

ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting

Paper • 2411.17176 • Published Nov 26, 2024 • 23

upvoted a collection 4 months ago

ChatGen

Collection

ChatGen series models • 7 items • Updated Nov 29, 2024 • 2

upvoted a collection 5 months ago

OS-Atlas

Collection

OS-Atlas series models • 7 items • Updated Nov 18, 2024 • 13

upvoted 2 papers 5 months ago

OS-ATLAS: A Foundation Action Model for Generalist GUI Agents

Paper • 2410.23218 • Published Oct 30, 2024 • 50

AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant

Paper • 2410.18603 • Published Oct 24, 2024 • 32

upvoted a collection 6 months ago

Molmo

Collection

Artifacts for open multimodal language models. • 5 items • Updated 26 days ago • 300

upvoted a paper 9 months ago

LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages

Paper • 2407.05975 • Published Jul 8, 2024 • 37

upvoted a paper about 1 year ago

A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond

Paper • 2403.14734 • Published Mar 21, 2024 • 21