Jixuan Chen's picture

Jixuan Chen

Mayome

·

https://chenjix.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper about 15 hours ago

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

authored a paper about 15 hours ago

MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents

authored a paper about 15 hours ago

OpenCUA: Open Foundations for Computer-Use Agents

View all activity

Organizations

authored 5 papers about 15 hours ago

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

Paper • 2411.07763 • Published Nov 12, 2024 • 2

MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents

Paper • 2507.19478 • Published Jul 25, 2025 • 33

OpenCUA: Open Foundations for Computer-Use Agents

Paper • 2508.09123 • Published Aug 12, 2025 • 33

VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos

Paper • 2510.19488 • Published Oct 22, 2025 • 21

DeliveryBench: Can Agents Earn Profit in Real World?

Paper • 2512.19234 • Published Dec 22, 2025

authored a paper about 16 hours ago

CocoaBench: Evaluating Unified Digital Agents in the Wild

Paper • 2604.11201 • Published 4 days ago • 32

authored a paper 11 months ago

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis

Paper • 2505.13227 • Published May 19, 2025 • 45

authored a paper about 1 year ago

Wan: Open and Advanced Large-Scale Video Generative Models

Paper • 2503.20314 • Published Mar 26, 2025 • 61

authored a paper over 1 year ago

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Paper • 2407.10956 • Published Jul 15, 2024 • 7

authored a paper almost 2 years ago

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Paper • 2404.07972 • Published Apr 11, 2024 • 52