Quantifying Generalization Complexity for Large Language Models Paper • 2410.01769 • Published Oct 2, 2024 • 14
Addition is All You Need for Energy-efficient Language Models Paper • 2410.00907 • Published Oct 1, 2024 • 146
Quantifying Generalization Complexity for Large Language Models Paper • 2410.01769 • Published Oct 2, 2024 • 14
Quantifying Generalization Complexity for Large Language Models Paper • 2410.01769 • Published Oct 2, 2024 • 14
QTSumm: A New Benchmark for Query-Focused Table Summarization Paper • 2305.14303 • Published May 23, 2023
Weakly Supervised Two-Stage Training Scheme for Deep Video Fight Detection Model Paper • 2209.11477 • Published Sep 23, 2022
ReasTAP: Injecting Table Reasoning Skills During Pre-training via Synthetic Reasoning Examples Paper • 2210.12374 • Published Oct 22, 2022
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers Paper • 2408.06195 • Published Aug 12, 2024 • 70
Training Task Experts through Retrieval Based Distillation Paper • 2407.05463 • Published Jul 7, 2024 • 10
Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts Paper • 2406.12034 • Published Jun 17, 2024 • 15
SUPERB: Speech processing Universal PERformance Benchmark Paper • 2105.01051 • Published May 3, 2021 • 1
DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering Paper • 2203.04911 • Published Mar 9, 2022
DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings Paper • 2204.10298 • Published Apr 21, 2022 • 1
DINOv2: Learning Robust Visual Features without Supervision Paper • 2304.07193 • Published Apr 14, 2023 • 6
Exploring Efficient-tuning Methods in Self-supervised Speech Models Paper • 2210.06175 • Published Oct 10, 2022
Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning Paper • 2309.02591 • Published Sep 5, 2023 • 15