MoDeGPT: Modular Decomposition for Large Language Model Compression Paper • 2408.09632 • Published Aug 19, 2024
Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition Paper • 2403.19822 • Published Mar 28, 2024
GeorgiaTech/0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_3 Text Generation • Updated May 13, 2024 • 6
GeorgiaTech/0.0005_zephyr_withdpo_5551_4iters_bs256_newtrl_iter_3 Text Generation • Updated May 12, 2024 • 4
GeorgiaTech/0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_2 Text Generation • Updated May 12, 2024 • 103
GeorgiaTech/0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_1 Text Generation • Updated May 12, 2024 • 40
ToolQA: A Dataset for LLM Question Answering with External Tools Paper • 2306.13304 • Published Jun 23, 2023
PEEKABOO: Interactive Video Generation via Masked-Diffusion Paper • 2312.07509 • Published Dec 12, 2023 • 8
ColloSSL: Collaborative Self-Supervised Learning for Human Activity Recognition Paper • 2202.00758 • Published Feb 1, 2022
AdaPlanner: Adaptive Planning from Feedback with Language Models Paper • 2305.16653 • Published May 26, 2023