LLM-3D Print: Large Language Models To Monitor and Control 3D Printing Paper • 2408.14307 • Published Aug 26, 2024 • 4
DSTI at LLMs4OL 2024 Task A: Intrinsic versus extrinsic knowledge for type classification Paper • 2408.14236 • Published Aug 26, 2024 • 5
Project SHADOW: Symbolic Higher-order Associative Deductive reasoning On Wikidata using LM probing Paper • 2408.14849 • Published Aug 27, 2024 • 5
Platypus: A Generalized Specialist Model for Reading Text in Various Forms Paper • 2408.14805 • Published Aug 27, 2024 • 14
Text2SQL is Not Enough: Unifying AI and Databases with TAG Paper • 2408.14717 • Published Aug 27, 2024 • 26
The Mamba in the Llama: Distilling and Accelerating Hybrid Models Paper • 2408.15237 • Published Aug 27, 2024 • 39
Writing in the Margins: Better Inference Pattern for Long Context Retrieval Paper • 2408.14906 • Published Aug 27, 2024 • 139
Multi-task retriever fine-tuning for domain-specific and efficient RAG Paper • 2501.04652 • Published 4 days ago • 8
EpiCoder: Encompassing Diversity and Complexity in Code Generation Paper • 2501.04694 • Published 4 days ago • 9
DPO Kernels: A Semantically-Aware, Kernel-Enhanced, and Divergence-Rich Paradigm for Direct Preference Optimization Paper • 2501.03271 • Published 8 days ago • 9
Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation Paper • 2501.04144 • Published 5 days ago • 14
SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images Paper • 2501.04689 • Published 4 days ago • 14
InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection Paper • 2501.04575 • Published 4 days ago • 21
LLM4SR: A Survey on Large Language Models for Scientific Research Paper • 2501.04306 • Published 5 days ago • 29
URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics Paper • 2501.04686 • Published 4 days ago • 44
Agent Laboratory: Using LLM Agents as Research Assistants Paper • 2501.04227 • Published 5 days ago • 66
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper • 2501.04682 • Published 4 days ago • 72