ParetoQ: Scaling Laws in Extremely Low-bit LLM Quantization Paper • 2502.02631 • Published Feb 4 • 3
mistralai/Mistral-Small-3.1-24B-Instruct-2503 Image-Text-to-Text • Updated 4 days ago • 115k • 1.06k
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated 8 days ago • 1.1M • 1.27k
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper • 2502.11089 • Published Feb 16 • 150