RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval Paper • 2409.10516 • Published Sep 16 • 34
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse Paper • 2409.11242 • Published Sep 17 • 5
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models Paper • 2409.11136 • Published Sep 17 • 21
Training Language Models on Synthetic Edit Sequences Improves Code Synthesis Paper • 2410.02749 • Published 17 days ago • 12
L-CiteEval: Do Long-Context Models Truly Leverage Context for Responding? Paper • 2410.02115 • Published 18 days ago • 10
Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations Paper • 2410.02762 • Published 17 days ago • 9
Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models Paper • 2410.01335 • Published 19 days ago • 5
RATIONALYST: Pre-training Process-Supervision for Improving Reasoning Paper • 2410.01044 • Published 19 days ago • 34
Quantifying Generalization Complexity for Large Language Models Paper • 2410.01769 • Published 19 days ago • 12
InfiniPot: Infinite Context Processing on Memory-Constrained LLMs Paper • 2410.01518 • Published 19 days ago • 2
Law of the Weakest Link: Cross Capabilities of Large Language Models Paper • 2409.19951 • Published 21 days ago • 53
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations Paper • 2410.02707 • Published 18 days ago • 44
Addition is All You Need for Energy-efficient Language Models Paper • 2410.00907 • Published 19 days ago • 136
Mentor-KD: Making Small Language Models Better Multi-step Reasoners Paper • 2410.09037 • Published 9 days ago • 4
Rethinking Data Selection at Scale: Random Selection is Almost All You Need Paper • 2410.09335 • Published 9 days ago • 13
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization Paper • 2410.08815 • Published 10 days ago • 34
SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights Paper • 2410.09008 • Published 10 days ago • 16
Mechanistic Permutability: Match Features Across Layers Paper • 2410.07656 • Published 11 days ago • 16
SimpleStrat: Diversifying Language Model Generation with Stratification Paper • 2410.09038 • Published 9 days ago • 4
PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness Paper • 2410.07035 • Published 12 days ago • 16
ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs Paper • 2410.12405 • Published 5 days ago • 13
Exploring Model Kinship for Merging Large Language Models Paper • 2410.12613 • Published 5 days ago • 19
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free Paper • 2410.10814 • Published 6 days ago • 44
Vector-ICL: In-context Learning with Continuous Vector Representations Paper • 2410.05629 • Published 13 days ago • 3
Intriguing Properties of Large Language and Vision Models Paper • 2410.04751 • Published 14 days ago • 16