oceansweep
's Collections
Relevant-Papers-Midterm
updated
Same Task, More Tokens: the Impact of Input Length on the Reasoning
Performance of Large Language Models
Paper
•
2402.14848
•
Published
•
18
The Prompt Report: A Systematic Survey of Prompting Techniques
Paper
•
2406.06608
•
Published
•
58
CRAG -- Comprehensive RAG Benchmark
Paper
•
2406.04744
•
Published
•
45
Transformers meet Neural Algorithmic Reasoners
Paper
•
2406.09308
•
Published
•
44
Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal
Language Models
Paper
•
2406.09403
•
Published
•
20
Interpreting the Weight Space of Customized Diffusion Models
Paper
•
2406.09413
•
Published
•
19
OpenVLA: An Open-Source Vision-Language-Action Model
Paper
•
2406.09246
•
Published
•
37
Alleviating Distortion in Image Generation via Multi-Resolution
Diffusion Models
Paper
•
2406.09416
•
Published
•
28
An Image is Worth More Than 16x16 Patches: Exploring Transformers on
Individual Pixels
Paper
•
2406.09415
•
Published
•
51
Paper
•
2406.09414
•
Published
•
97
Large Language Model Confidence Estimation via Black-Box Access
Paper
•
2406.04370
•
Published
•
21
DataComp-LM: In search of the next generation of training sets for
language models
Paper
•
2406.11794
•
Published
•
50
Florence-2: Advancing a Unified Representation for a Variety of Vision
Tasks
Paper
•
2311.06242
•
Published
•
89
Breaking the Attention Bottleneck
Paper
•
2406.10906
•
Published
•
4
Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of
Multimodal Large Language Models
Paper
•
2406.11230
•
Published
•
34
google/xtr-base-multilingual
Updated
•
71
•
9
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs
Paper
•
2406.15319
•
Published
•
64
Agentless: Demystifying LLM-based Software Engineering Agents
Paper
•
2407.01489
•
Published
•
59
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems
Paper
•
2407.01370
•
Published
•
86
LiteSearch: Efficacious Tree Search for LLM
Paper
•
2407.00320
•
Published
•
38
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in
Large Language Models Using Only Attention Maps
Paper
•
2407.07071
•
Published
•
12
AgentInstruct: Toward Generative Teaching with Agentic Flows
Paper
•
2407.03502
•
Published
•
51
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference
Paper
•
2407.14057
•
Published
•
45
BABILong: Testing the Limits of LLMs with Long Context
Reasoning-in-a-Haystack
Paper
•
2406.10149
•
Published
•
49
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for
Reinforcement Learning and Monte-Carlo Tree Search
Paper
•
2408.08152
•
Published
•
54
Why Does the Effective Context Length of LLMs Fall Short?
Paper
•
2410.18745
•
Published
•
17
arcee-ai/SuperNova-Medius-GGUF
Updated
•
1.98k
•
59