arxiv:2406.12375
Haoze Wu
WaitHZ
AI & ML interests
Modular DL, Complex Reasoning
Recent Activity
upvoted
a
paper
about 15 hours ago
Sigma: Differential Rescaling of Query, Key and Value for Efficient
Language Models
upvoted
a
paper
about 16 hours ago
Autonomy-of-Experts Models
commented on
a paper
about 16 hours ago
Autonomy-of-Experts Models
Organizations
None yet
Papers
1
models
None public yet
datasets
None public yet