arxiv:2501.13629
Yeyun Gong
yegong
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
3 days ago
Optimizing Large Language Model Training Using FP4 Quantization
authored
a paper
11 days ago
Sigma: Differential Rescaling of Query, Key and Value for Efficient
Language Models
upvoted
a
paper
11 days ago
Sigma: Differential Rescaling of Query, Key and Value for Efficient
Language Models
Organizations
None yet
models
None public yet
datasets
None public yet