2 4 1

Zhuokai Zhao

zhuokai

https://zhuokai-zhao.com/

AI & ML interests

Data-Efficient Learning, LLM Reasoning and Safety, Active Learning, Recommender System

Recent Activity

new activity 3 days ago

meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8:Quantizer: Running into an error with quantization "TypeError: 'dict' object is not callable"

upvoted a paper 13 days ago

ShieldAgent: Shielding Agents via Verifiable Safety Policy Reasoning

updated a model about 1 month ago

MoeReward/reward_lora_qwen_1_5_base

View all activity

Organizations

zhuokai's activity

New activity in meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 3 days ago

Quantizer: Running into an error with quantization "TypeError: 'dict' object is not callable"

#24 opened 4 days ago by

AaronVogler

upvoted a paper 13 days ago

ShieldAgent: Shielding Agents via Verifiable Safety Policy Reasoning

Paper • 2503.22738 • Published 25 days ago • 15

updated a model about 1 month ago

MoeReward/reward_lora_qwen_1_5_base

Updated about 1 month ago • 7

published a model about 1 month ago

MoeReward/reward_lora_qwen_1_5_base

Updated about 1 month ago • 7

updated a model about 1 month ago

MoeReward/reward_qwen_1_5

Updated Mar 17 • 2

published a model about 1 month ago

MoeReward/reward_qwen_1_5

Updated Mar 17 • 2

updated a model about 1 month ago

MoeReward/reward_lora_qwen_1_5

Updated Mar 17 • 2

published a model about 1 month ago

MoeReward/reward_lora_qwen_1_5

Updated Mar 17 • 2

updated a model about 1 month ago

MoeReward/sft_full_param_qwen_1_5

Updated Mar 16 • 3

published a model about 1 month ago

MoeReward/sft_full_param_qwen_1_5

Updated Mar 16 • 3

authored a paper about 1 month ago

HumanMM: Global Human Motion Recovery from Multi-shot Videos

Paper • 2503.07597 • Published Mar 10 • 2

published 2 models 2 months ago

zhuokai/InternVL2-2B-Open-R1-GRPO

Updated Feb 19

zhuokai/InternVL2-26B-Open-R1-GRPO

Updated Feb 19

upvoted a paper 7 months ago

Quantifying Generalization Complexity for Large Language Models

Paper • 2410.01769 • Published Oct 2, 2024 • 14

authored a paper 7 months ago

Quantifying Generalization Complexity for Large Language Models

Paper • 2410.01769 • Published Oct 2, 2024 • 14

upvoted a paper 9 months ago

AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases

Paper • 2407.12784 • Published Jul 17, 2024 • 52

liked a Space 10 months ago

MJ Bench Leaderboard

🥇

Display and filter multimodal model leaderboard results

upvoted a paper 10 months ago

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

Paper • 2407.04842 • Published Jul 5, 2024 • 57

authored a paper 10 months ago

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

Paper • 2407.04842 • Published Jul 5, 2024 • 57

authored a paper 12 months ago

HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding

Paper • 2403.00425 • Published Mar 1, 2024 • 1