Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization Paper • 2405.15071 • Published May 23 • 37
Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs Paper • 2311.05657 • Published Nov 9, 2023 • 27
Simple and Scalable Strategies to Continually Pre-train Large Language Models Paper • 2403.08763 • Published Mar 13 • 49
Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification Paper • 2403.04696 • Published Mar 7 • 4
Fine Tuning vs. Retrieval Augmented Generation for Less Popular Knowledge Paper • 2403.01432 • Published Mar 3 • 2
Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering Paper • 2401.08500 • Published Jan 16 • 5
Large language models surpass human experts in predicting neuroscience results Paper • 2403.03230 • Published Mar 4 • 4
DeLLMa: A Framework for Decision Making Under Uncertainty with Large Language Models Paper • 2402.02392 • Published Feb 4 • 5
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper • 2402.13753 • Published Feb 21 • 112
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models Paper • 2402.10986 • Published Feb 16 • 77
Foundation AI Papers Collection Curated List of Must-Reads on LLM reasoning at Temus AI team • 135 items • Updated Jun 15 • 27