Zhenran Xu's picture

Zhenran Xu

imryanxu

·

AI & ML interests

fishing in lab while working on language agents

Recent Activity

upvoted a paper 22 days ago

New Trends for Modern Machine Translation with Large Reasoning Models

updated a dataset 23 days ago

HIT-TMG/YiZhao

upvoted a collection about 1 month ago

View all activity

Organizations

imryanxu's activity

upvoted a paper 22 days ago

New Trends for Modern Machine Translation with Large Reasoning Models

Paper • 2503.10351 • Published 22 days ago • 22

upvoted a collection about 1 month ago

YiZhao Dataset

Data and filtering models of our financial open-source YiZhao Dataset. • 5 items • Updated Jan 10 • 1

upvoted 3 papers about 1 month ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published Feb 13 • 192

AIDE: AI-Driven Exploration in the Space of Code

Paper • 2502.13138 • Published Feb 18 • 7

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 99

upvoted 2 collections about 2 months ago

Ovis2

Our latest advancement in multi-modal large language models (MLLMs) • 15 items • Updated 10 days ago • 59

DeepSeek-VL2

5 items • Updated Feb 9 • 72

upvoted 3 papers 2 months ago

FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces

Paper • 2501.12909 • Published Jan 22 • 69

Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks

Paper • 2501.11733 • Published Jan 20 • 28

KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model

Paper • 2501.01028 • Published Jan 2 • 14

upvoted a collection 2 months ago

KaLM-embedding

11 items • Updated 24 days ago • 24

upvoted 3 papers 2 months ago

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Paper • 2501.10893 • Published Jan 18 • 25

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published Jan 21 • 56

PaSa: An LLM Agent for Comprehensive Academic Paper Search

Paper • 2501.10120 • Published Jan 17 • 48

upvoted 4 papers 3 months ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 283

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published Dec 23, 2024 • 43

DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought

Paper • 2412.17498 • Published Dec 23, 2024 • 22

Agent-SafetyBench: Evaluating the Safety of LLM Agents

Paper • 2412.14470 • Published Dec 19, 2024 • 12

upvoted 2 papers 4 months ago

RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios

Paper • 2412.08972 • Published Dec 12, 2024 • 10

VisionArena: 230K Real World User-VLM Conversations with Preference Labels

Paper • 2412.08687 • Published Dec 11, 2024 • 13