arxiv:2501.11873
Ivan Titov
Ivanchoo
AI & ML interests
None yet
Recent Activity
authored
a paper
12 days ago
Demons in the Detail: On Implementing Load Balancing Loss for Training
Specialized Mixture-of-Expert Models
authored
a paper
6 months ago
Layerwise Recurrent Router for Mixture-of-Experts
Organizations
None yet