Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
6
Haoze Wu
WaitHZ
Follow
https://waithz.github.io/
HaozeWu7
WaitHZ
AI & ML interests
Modular DL, Complex Reasoning
Recent Activity
upvoted
a
paper
about 20 hours ago
Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
upvoted
a
paper
about 21 hours ago
Autonomy-of-Experts Models
commented
on
a paper
about 21 hours ago
Autonomy-of-Experts Models
View all activity
Organizations
None yet
WaitHZ
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
commented
a paper
about 21 hours ago
Autonomy-of-Experts Models
Paper
•
2501.13074
•
Published
2 days ago
•
34
•
5
New activity in
deepseek-ai/deepseek-moe-16b-base
10 months ago
A little question about aux_loss
2
#4 opened 11 months ago by
WaitHZ
A little question about aux_loss
2
#4 opened 11 months ago by
WaitHZ
New activity in
deepseek-ai/deepseek-moe-16b-base
11 months ago
A little question about aux_loss
2
#4 opened 11 months ago by
WaitHZ