API Pack: A Massive Multilingual Dataset for API Call Generation Paper • 2402.09615 • Published Feb 14, 2024
Granite Code Models: A Family of Open Foundation Models for Code Intelligence Paper • 2405.04324 • Published May 7, 2024 • 22
Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks Paper • 2407.00121 • Published Jun 27, 2024
Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler Paper • 2408.13359 • Published Aug 23, 2024 • 24
Octo-planner: On-device Language Model for Planner-Action Agents Paper • 2406.18082 • Published Jun 26, 2024 • 48
JetMoE: Reaching Llama2 Performance with 0.1M Dollars Paper • 2404.07413 • Published Apr 11, 2024 • 37
API Pack: A Massive Multilingual Dataset for API Call Generation Paper • 2402.09615 • Published Feb 14, 2024
Diversity Measurement and Subset Selection for Instruction Tuning Datasets Paper • 2402.02318 • Published Feb 4, 2024 • 2
Dr. LLaMA: Improving Small Language Models in Domain-Specific QA via Generative Data Augmentation Paper • 2305.07804 • Published May 12, 2023 • 2