Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity Paper • 2502.13063 • Published 19 days ago • 65
LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection Paper • 2408.04284 • Published Aug 8, 2024 • 26
Active Learning for Sequence Tagging with Deep Pre-trained Models and Bayesian Uncertainty Estimates Paper • 2101.08133 • Published Jan 20, 2021
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents Paper • 2407.04363 • Published Jul 5, 2024 • 31
BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack Paper • 2406.10149 • Published Jun 14, 2024 • 49
BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack Paper • 2406.10149 • Published Jun 14, 2024 • 49
Vikhr: The Family of Open-Source Instruction-Tuned Large Language Models for Russian Paper • 2405.13929 • Published May 22, 2024 • 54
Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification Paper • 2403.04696 • Published Mar 7, 2024 • 4
M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection Paper • 2305.14902 • Published May 24, 2023 • 1