zijie tian
zijie-tian
AI & ML interests
Storage for AI
Recent Activity
upvoted
a
paper
11 days ago
PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers
in LLMs
upvoted
a
paper
about 2 months ago
LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior
Accuracy Preservation
upvoted
a
paper
2 months ago
Task-KV: Task-aware KV Cache Optimization via Semantic Differentiation
of Attention Heads
Organizations
models
0
None public yet
datasets
0
None public yet