Offline Reinforcement Learning for LLM Multi-Step Reasoning Paper • 2412.16145 • Published 4 days ago • 25
Optimizing Language Augmentation for Multilingual Large Language Models: A Case Study on Korean Paper • 2403.10882 • Published Mar 16 • 5
X-LLaVA: Optimizing Bilingual Large Vision-Language Alignment Paper • 2403.11399 • Published Mar 18 • 6
BOK-VQA: Bilingual outside Knowledge-Based Visual Question Answering via Graph Representation Pretraining Paper • 2401.06443 • Published Jan 12 • 2
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated 18 days ago • 696
AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning Paper • 2402.15506 • Published Feb 23 • 14
In deep reinforcement learning, a pruned network is a good network Paper • 2402.12479 • Published Feb 19 • 18