2 55 126

Wenhao Chai

wchai

http://rese1f.github.io

AI & ML interests

computer vision, artificial intelligence

Recent Activity

upvoted a paper about 7 hours ago

MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice

upvoted an article about 8 hours ago

Open R1: Update #3

liked a model 6 days ago

ai21labs/AI21-Jamba-Large-1.6

View all activity

Organizations

wchai's activity

upvoted a paper about 7 hours ago

MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice

Paper • 2503.05978 • Published 5 days ago • 24

upvoted an article about 8 hours ago

Article

Open R1: Update #3

and 7 others •

about 17 hours ago

• 129

liked a model 6 days ago

ai21labs/AI21-Jamba-Large-1.6

Text Generation • Updated 6 days ago • 395 • 54

upvoted a paper 6 days ago

GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control

Paper • 2503.03751 • Published 7 days ago • 19

liked a model 7 days ago

Qwen/QwQ-32B

Text Generation • Updated 1 day ago • 208k • • 2.02k

upvoted a collection 8 days ago

C4AI Aya Vision

Collection

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 8 days ago • 62

authored a paper 12 days ago

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Paper • 2502.20172 • Published 13 days ago • 26

upvoted 2 papers 12 days ago

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Paper • 2502.20172 • Published 13 days ago • 26

MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

Paper • 2502.19634 • Published 14 days ago • 57

upvoted a collection 16 days ago

QwQ

Collection

Qwen with Questions • 6 items • Updated 6 days ago • 77

liked 2 models 18 days ago

google/siglip2-so400m-patch14-384

Zero-Shot Image Classification • Updated 19 days ago • 589k • 13

google/siglip2-so400m-patch16-naflex

Zero-Shot Image Classification • Updated 19 days ago • 15.6k • 15

upvoted 2 papers 19 days ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published 20 days ago • 178

Five A^{+} Network: You Only Need 9K Parameters for Underwater Image Enhancement

Paper • 2305.08824 • Published May 15, 2023 • 2

liked a Space 21 days ago

2.21k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 21 days ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

22 days ago

• 65

liked a model 21 days ago

BlinkDL/rwkv-6-world

Text Generation • Updated Nov 13, 2024 • 145

upvoted a paper 26 days ago

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published 27 days ago • 32

liked a model about 1 month ago

simplescaling/s1-32B

Text Generation • Updated 14 days ago • 15.5k • 288

upvoted a paper about 1 month ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 108