10 16 104

Xie

Zhihui

https://zhxie.site/

zhxieml

AI & ML interests

None yet

Recent Activity

upvoted a paper about 3 hours ago

CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era

liked a dataset 6 days ago

nebius/SWE-bench-extra

liked a dataset 6 days ago

open-r1/codeforces-cots

View all activity

Organizations

Zhihui's activity

upvoted a paper about 3 hours ago

CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era

Paper • 2503.12329 • Published 3 days ago • 17

liked 2 datasets 6 days ago

nebius/SWE-bench-extra

Viewer • Updated 16 days ago • 6.38k • 587 • 41

open-r1/codeforces-cots

Viewer • Updated 2 days ago • 238k • 3.31k • 74

liked a model 7 days ago

RekaAI/reka-flash-3

Updated 6 days ago • 3.56k • 295

liked a dataset 7 days ago

open-r1/codeforces

Viewer • Updated 8 days ago • 10k • 769 • 24

liked a Space 24 days ago

Predict Memory

🧮

Calculate memory usage from model configurations

liked a Space 25 days ago

2.3k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a dataset 27 days ago

HuggingFaceH4/aime_2024

Viewer • Updated Jan 26 • 30 • 19.5k • 22

liked a dataset 28 days ago

open-r1/OpenR1-Math-220k

Viewer • Updated 29 days ago • 450k • 52.4k • 514

New activity in Zhihui/CTRL-32B 29 days ago

Add library_name and pipeline_tag metadata

#1 opened about 1 month ago by

nielsr

liked a dataset about 1 month ago

allenai/RLVR-GSM-MATH-IF-Mixed-Constraints

Viewer • Updated Nov 26, 2024 • 29.9k • 1.04k • 20

authored a paper about 1 month ago

Teaching Language Models to Critique via Reinforcement Learning

Paper • 2502.03492 • Published Feb 5 • 24

upvoted a paper about 1 month ago

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published Feb 11 • 47

liked a model about 1 month ago

Zhihui/CTRL-32B

Text Generation • Updated 29 days ago • 124 • 4

updated a model about 1 month ago

Zhihui/CTRL-32B

Text Generation • Updated 29 days ago • 124 • 4

upvoted a collection about 1 month ago

UI Agent

Collection

a collection of algorithmic agents for user interfaces/interactions, program synthesis, and robots • 323 items • Updated about 13 hours ago • 48

upvoted a paper about 1 month ago

Teaching Language Models to Critique via Reinforcement Learning

Paper • 2502.03492 • Published Feb 5 • 24

commented a paper about 1 month ago

Teaching Language Models to Critique via Reinforcement Learning

Paper • 2502.03492 • Published Feb 5 • 24 •

published a model about 1 month ago

Zhihui/CTRL-32B

Text Generation • Updated 29 days ago • 124 • 4

liked a dataset about 1 month ago

allenai/tulu-3-sft-mixture

Viewer • Updated Dec 2, 2024 • 939k • 5.63k • 123