17 28 2

Tianyi Zhou

zhoutianyi

https://tianyizhou.github.io/

AI & ML interests

ML, NLP, RL, Multi-modality

Recent Activity

authored a paper 3 days ago

GraphicBench: A Planning Benchmark for Graphic Design with Language Agents

authored a paper 3 days ago

Exploring Expert Failures Improves LLM Agent Tuning

commented on a paper 3 days ago

Exploring Expert Failures Improves LLM Agent Tuning

View all activity

Organizations

zhoutianyi's activity

authored 2 papers 3 days ago

GraphicBench: A Planning Benchmark for Graphic Design with Language Agents

Paper • 2504.11571 • Published 5 days ago

Exploring Expert Failures Improves LLM Agent Tuning

Paper • 2504.13145 • Published 3 days ago • 11

commented 2 papers 3 days ago

Exploring Expert Failures Improves LLM Agent Tuning

Paper • 2504.13145 • Published 3 days ago • 11 •

Exploring Expert Failures Improves LLM Agent Tuning

Paper • 2504.13145 • Published 3 days ago • 11 •

liked a dataset 3 days ago

umd-zhou-lab/ColorBench

Viewer • Updated 1 day ago • 5.81k • 85 • 3

authored 3 papers 4 days ago

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

Paper • 2502.14296 • Published Feb 20 • 46

AutoBench-V: Can Large Vision-Language Models Benchmark Themselves?

Paper • 2410.21259 • Published Oct 28, 2024 • 1

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Paper • 2504.10514 • Published 11 days ago • 45

upvoted a paper 4 days ago

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Paper • 2504.10514 • Published 11 days ago • 45

commented 3 papers 4 days ago

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Paper • 2504.10514 • Published 11 days ago • 45 •

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Paper • 2504.10514 • Published 11 days ago • 45 •

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Paper • 2504.10514 • Published 11 days ago • 45 •

authored 2 papers 5 days ago

Efficient Reinforcement Finetuning via Adaptive Curriculum Learning

Paper • 2504.05520 • Published 13 days ago • 9

How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients

Paper • 2504.10766 • Published 6 days ago • 37

upvoted 2 papers 5 days ago

Efficient Reinforcement Finetuning via Adaptive Curriculum Learning

Paper • 2504.05520 • Published 13 days ago • 9

How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients

Paper • 2504.10766 • Published 6 days ago • 37

commented a paper 5 days ago

How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients

Paper • 2504.10766 • Published 6 days ago • 37 •

upvoted a paper 8 days ago

Towards Visual Text Grounding of Multimodal Large Language Model

Paper • 2504.04974 • Published 14 days ago • 15

authored 2 papers 10 days ago

Towards Visual Text Grounding of Multimodal Large Language Model

Paper • 2504.04974 • Published 14 days ago • 15

C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing

Paper • 2504.07964 • Published 10 days ago • 59