ISTA-DASLab/Meta-Llama-3.1-70B-Instruct-AQLM-PV-2Bit-1x16 Text Generation โข Updated Sep 17, 2024 โข 58 โข 46
sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity โข Updated 13 days ago โข 94.1M โข โข 3.14k
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design Paper โข 2401.14112 โข Published Jan 25, 2024 โข 20