Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper โข 2502.06703 โข Published Feb 10 โข 142
view article Article Assisted Generation: a new direction toward low-latency text generation May 11, 2023 โข 47
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference Jan 16 โข 71
Agentless: Demystifying LLM-based Software Engineering Agents Paper โข 2407.01489 โข Published Jul 1, 2024 โข 61
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb โข Nov 28, 2024 โข 138
view article Article Unlocking Longer Generation with Key-Value Cache Quantization May 16, 2024 โข 44
LLaVa-NeXT Collection LLaVa-NeXT (also known as LLaVa-1.6) improves upon the 1.5 series by incorporating higher image resolutions and more reasoning/OCR datasets. โข 8 items โข Updated Jul 19, 2024 โข 29
LLaVA-1.6 Collection A collection of LLaVA-1.6 checkpoints โข 4 items โข Updated Jan 31, 2024 โข 69
LLM papers Collection It is a collection of papers that are useful in studying LLM. โข 14 items โข Updated Apr 3, 2024 โข 12