Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper ā¢ 2502.06703 ā¢ Published Feb 10 ā¢ 142
view article Article Assisted Generation: a new direction toward low-latency text generation May 11, 2023 ā¢ 47
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference Jan 16 ā¢ 71
Agentless: Demystifying LLM-based Software Engineering Agents Paper ā¢ 2407.01489 ā¢ Published Jul 1, 2024 ā¢ 61
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb ā¢ Nov 28, 2024 ā¢ 138
view article Article Unlocking Longer Generation with Key-Value Cache Quantization May 16, 2024 ā¢ 45
LLaVa-NeXT Collection LLaVa-NeXT (also known as LLaVa-1.6) improves upon the 1.5 series by incorporating higher image resolutions and more reasoning/OCR datasets. ā¢ 8 items ā¢ Updated Jul 19, 2024 ā¢ 29
LLaVA-1.6 Collection A collection of LLaVA-1.6 checkpoints ā¢ 4 items ā¢ Updated Jan 31, 2024 ā¢ 69
LLM papers Collection It is a collection of papers that are useful in studying LLM. ā¢ 14 items ā¢ Updated Apr 3, 2024 ā¢ 12