4 16 3

Kanzhi Cheng

cckevinn

AI & ML interests

None yet

Recent Activity

upvoted a paper about 15 hours ago

Could Thinking Multilingually Empower LLM Reasoning?

authored a paper 4 days ago

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

upvoted a paper 5 days ago

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

View all activity

Organizations

cckevinn's activity

upvoted a paper about 15 hours ago

Could Thinking Multilingually Empower LLM Reasoning?

Paper • 2504.11833 • Published 5 days ago • 15

authored a paper 4 days ago

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

Paper • 2504.08672 • Published 10 days ago • 52

upvoted a paper 5 days ago

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

Paper • 2504.08672 • Published 10 days ago • 52

upvoted a paper 6 days ago

Breaking the Data Barrier -- Building GUI Agents Through Task Generalization

Paper • 2504.10127 • Published 7 days ago • 16

upvoted 3 papers about 1 month ago

STEVE: AStep Verification Pipeline for Computer-use Agent Training

Paper • 2503.12532 • Published Mar 16 • 15

φ-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation

Paper • 2503.13288 • Published Mar 17 • 50

CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era

Paper • 2503.12329 • Published Mar 16 • 24

commented a paper about 1 month ago

CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era

Paper • 2503.12329 • Published Mar 16 • 24 •

authored a paper about 1 month ago

CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era

Paper • 2503.12329 • Published Mar 16 • 24

liked a Space about 1 month ago

CapArena Auto 1

🥇

Display Leaderboard of LLM Model Evaluations

liked a Space about 2 months ago

ACL Pubcheck

📝

Check your paper for ACL guidelines

upvoted 2 papers 2 months ago

BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models

Paper • 2502.07346 • Published Feb 11 • 53

Teaching Language Models to Critique via Reinforcement Learning

Paper • 2502.03492 • Published Feb 5 • 24

upvoted a paper 3 months ago

Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models

Paper • 2501.18119 • Published Jan 30 • 25

updated 2 datasets 3 months ago

OS-Copilot/OS-Genesis-web-data

Updated Mar 17 • 51 • 2

OS-Copilot/OS-Genesis-mobile-data

Viewer • Updated Mar 17 • 51.1k • 200 • 2

authored 4 papers 3 months ago

SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents

Paper • 2401.10935 • Published Jan 17, 2024 • 4

Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models

Paper • 2406.11736 • Published Jun 17, 2024 • 5

Vision-Language Models Can Self-Improve Reasoning via Reflection

Paper • 2411.00855 • Published Oct 30, 2024 • 5

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published Dec 27, 2024 • 89