9 17 112

Xie

Zhihui

https://zhxie.site/

zhxieml

AI & ML interests

None yet

Recent Activity

liked a dataset 5 days ago

OpenCoder-LLM/opc-sft-stage1

liked a dataset 13 days ago

agentica-org/DeepCoder-Preview-Dataset

liked a dataset 14 days ago

nvidia/OpenCodeReasoning

View all activity

Organizations

Zhihui's activity

upvoted a paper 15 days ago

MegaMath: Pushing the Limits of Open Math Corpora

Paper • 2504.02807 • Published 18 days ago • 30

upvoted a paper about 1 month ago

CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era

Paper • 2503.12329 • Published Mar 16 • 24

upvoted a paper 2 months ago

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published Feb 11 • 48

upvoted a collection 2 months ago

UI Agent

Collection

a collection of algorithmic agents for user interfaces/interactions, program synthesis, and robotics • 357 items • Updated 44 minutes ago • 52

upvoted a paper 2 months ago

Teaching Language Models to Critique via Reinforcement Learning

Paper • 2502.03492 • Published Feb 5 • 24

upvoted 2 papers 4 months ago

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published Dec 27, 2024 • 89

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published Dec 23, 2024 • 44

upvoted a paper 5 months ago

VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models

Paper • 2411.17451 • Published Nov 26, 2024 • 11

upvoted a paper 7 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 63

upvoted 2 papers 9 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 163

Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models

Paper • 2404.12387 • Published Apr 18, 2024 • 40

upvoted a paper 10 months ago

Jailbreaking as a Reward Misspecification Problem

Paper • 2406.14393 • Published Jun 20, 2024 • 13

upvoted an article 10 months ago

Article

Putting RL back in RLHF

Jun 12, 2024

• 87

upvoted 2 papers 11 months ago

A Primer on the Inner Workings of Transformer-based Language Models

Paper • 2405.00208 • Published Apr 30, 2024 • 10

Calibrating Reasoning in Language Models with Internal Consistency

Paper • 2405.18711 • Published May 29, 2024 • 6

upvoted a collection 11 months ago

🔍 Interpretability & Analysis of LMs

Collection

Outstanding research in LM interpretability and evaluation, summarized • 107 items • Updated 9 days ago • 99

upvoted a paper over 1 year ago

Silkie: Preference Distillation for Large Visual Language Models

Paper • 2312.10665 • Published Dec 17, 2023 • 11