arxiv:2411.10958
Jianfei Chen
surfingtomchen
AI & ML interests
None yet
Recent Activity
authored
a paper
about 2 months ago
SageAttention2 Technical Report: Accurate 4 Bit Attention for
Plug-and-play Inference Acceleration
authored
a paper
3 months ago
SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference
Acceleration
Organizations
None yet