8 21 4

Canyu Chen

canyuchen

https://canyuchen.com/

AI & ML interests

None yet

Recent Activity

upvoted a paper 25 days ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

upvoted a paper 29 days ago

ENACT: Evaluating Embodied Cognition with World Modeling of Egocentric Interaction

upvoted a paper about 1 month ago

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

View all activity

Organizations

upvoted a paper 25 days ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26 • 109

upvoted a paper 29 days ago

ENACT: Evaluating Embodied Cognition with World Modeling of Egocentric Interaction

Paper • 2511.20937 • Published Nov 26 • 15

upvoted a paper about 1 month ago

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20 • 108

upvoted a paper about 2 months ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5 • 81

upvoted a paper 2 months ago

LLMs Can Get "Brain Rot"!

Paper • 2510.13928 • Published Oct 15 • 22

upvoted 3 papers 5 months ago

upvoted a paper 6 months ago

Spatial Mental Modeling from Limited Views

Paper • 2506.21458 • Published Jun 26 • 13

upvoted a paper 8 months ago

RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning

Paper • 2504.20073 • Published Apr 24 • 12

upvoted 2 papers 10 months ago

MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

Paper • 2502.19634 • Published Feb 26 • 63

SearchRAG: Can Search Engines Be Helpful for LLM-based Medical Question Answering?

Paper • 2502.13233 • Published Feb 18 • 15

upvoted 3 papers about 1 year ago

From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge

Paper • 2411.16594 • Published Nov 25, 2024 • 39

ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?

Paper • 2411.06469 • Published Nov 10, 2024 • 17

Can Knowledge Editing Really Correct Hallucinations?

Paper • 2410.16251 • Published Oct 21, 2024 • 55

upvoted 5 papers over 1 year ago

Can Editing LLMs Inject Harm?

Paper • 2407.20224 • Published Jul 29, 2024 • 3

Authorship Attribution in the Era of LLMs: Problems, Methodologies, and Challenges

Paper • 2408.08946 • Published Aug 16, 2024 • 12

AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases

Paper • 2407.12784 • Published Jul 17, 2024 • 51

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

Paper • 2407.04842 • Published Jul 5, 2024 • 55

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Paper • 2404.12241 • Published Apr 18, 2024 • 13

Canyu Chen

AI & ML interests

Recent Activity

Organizations

canyuchen's activity