Kanzhi Cheng's picture

4 13 3

Kanzhi Cheng

cckevinn

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 26 days ago

STEVE: AStep Verification Pipeline for Computer-use Agent Training

upvoted a paper 26 days ago

φ-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation

upvoted a paper 27 days ago

CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era

View all activity

Organizations

cckevinn's activity

upvoted 2 papers 26 days ago

STEVE: AStep Verification Pipeline for Computer-use Agent Training

Paper • 2503.12532 • Published 30 days ago • 14

φ-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation

Paper • 2503.13288 • Published 28 days ago • 49

upvoted a paper 27 days ago

CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era

Paper • 2503.12329 • Published about 1 month ago • 24

upvoted 3 papers 2 months ago

BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models

Paper • 2502.07346 • Published Feb 11 • 53

Teaching Language Models to Critique via Reinforcement Learning

Paper • 2502.03492 • Published Feb 5 • 24

Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models

Paper • 2501.18119 • Published Jan 30 • 25

upvoted a paper 3 months ago

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published Dec 27, 2024 • 89

upvoted a collection 4 months ago

OS-Genesis

11 items • Updated Jan 6 • 6

upvoted a collection 5 months ago

Symbol-LLM

4 items • Updated Nov 11, 2024 • 5

upvoted 2 papers 5 months ago

Vision-Language Models Can Self-Improve Reasoning via Reflection

Paper • 2411.00855 • Published Oct 30, 2024 • 5

OS-ATLAS: A Foundation Action Model for Generalist GUI Agents

Paper • 2410.23218 • Published Oct 30, 2024 • 51

upvoted a paper 6 months ago

AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant

Paper • 2410.18603 • Published Oct 24, 2024 • 33

upvoted a paper about 1 year ago

A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond

Paper • 2403.14734 • Published Mar 21, 2024 • 21