A dynamic parallel method for performance optimization on hybrid CPUs Paper • 2411.19542 • Published Nov 29, 2024 • 5
ClimDetect: A Benchmark Dataset for Climate Change Detection and Attribution Paper • 2408.15993 • Published Aug 28, 2024 • 8
Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models Paper • 2402.16786 • Published Feb 26, 2024
Using Imperfect Surrogates for Downstream Inference: Design-based Supervised Learning for Social Science Applications of Large Language Models Paper • 2306.04746 • Published Jun 7, 2023
Why do LLaVA Vision-Language Models Reply to Images in English? Paper • 2407.02333 • Published Jul 2, 2024
LLaVA-Gemma: Accelerating Multimodal Foundation Models with a Compact Language Model Paper • 2404.01331 • Published Mar 29, 2024 • 25
TEQ: Trainable Equivalent Transformation for Quantization of LLMs Paper • 2310.10944 • Published Oct 17, 2023 • 9
Efficient Post-training Quantization with FP8 Formats Paper • 2309.14592 • Published Sep 26, 2023 • 10
Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs Paper • 2309.05516 • Published Sep 11, 2023 • 9
An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs Paper • 2306.16601 • Published Jun 28, 2023 • 4
Prune Once for All: Sparse Pre-Trained Language Models Paper • 2111.05754 • Published Nov 10, 2021 • 1