Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
10
Yuezhou Hu
yuezhouhu
Follow
AnonTokyo888's profile picture
foreverpiano's profile picture
sglucas's profile picture
6 followers
·
4 following
https://yuezhouhu.github.io/
yuezhouhu
yuezhouhu
AI & ML interests
My research interests include efficient machine learning, particularly efficient training and inference.
Recent Activity
upvoted
a
paper
6 days ago
SLA2: Sparse-Linear Attention with Learnable Routing and QAT
authored
a paper
11 days ago
SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning
upvoted
a
paper
11 days ago
SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning
View all activity
Organizations
yuezhouhu
's models
12
Sort: Recently updated
yuezhouhu/RCD-LLaDA-8B-Instruct
1B
•
Updated
29 days ago
•
23
yuezhouhu/SeqD-LLaDA-8B-Instruct
1B
•
Updated
29 days ago
•
16
yuezhouhu/RCD-SDAR-8B-b64-Thinking
8B
•
Updated
Jan 30
•
12
yuezhouhu/RCD-SDAR-8B-b32-Thinking
8B
•
Updated
Jan 30
•
6
yuezhouhu/RCD-SDAR-4B-b64-Thinking
4B
•
Updated
Jan 30
•
30
yuezhouhu/RCD-SDAR-4B-b32-Thinking
4B
•
Updated
Jan 30
•
31
yuezhouhu/SeqD-SDAR-8B-b64-Thinking
8B
•
Updated
Jan 30
•
15
yuezhouhu/SeqD-SDAR-8B-b32-Thinking
8B
•
Updated
Jan 30
•
11
yuezhouhu/SeqD-SDAR-4B-b64-Thinking
4B
•
Updated
Jan 30
•
10
yuezhouhu/SeqD-SDAR-4B-b32-Thinking
4B
•
Updated
Jan 30
•
8
yuezhouhu/SeqD-SDAR-1.7B-b64-Thinking
2B
•
Updated
Jan 30
•
34
yuezhouhu/SeqD-SDAR-1.7B-b32-Thinking
2B
•
Updated
Jan 30
•
97