LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token Paper β’ 2501.03895 β’ Published 6 days ago β’ 44
view article Article Deploying Your FastAPI Applications on Huggingface Via Docker By HemanthSai7 β’ Dec 11, 2023 β’ 19
Evaluating Tokenizer Performance of Large Language Models Across Official Indian Languages Paper β’ 2411.12240 β’ Published Nov 19, 2024 β’ 6
view article Article MedEmbed: Fine-Tuned Embedding Models for Medical / ClinicalΒ IR By abhinand β’ Oct 20, 2024 β’ 35
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy Sep 18, 2024 β’ 216
view article Article How to build a custom text classifier without days of human labeling By sdiazlor β’ Oct 17, 2024 β’ 55
view article Article The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare Apr 19, 2024 β’ 129
view article Article π€ PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware Feb 10, 2023 β’ 45
view article Article Fine-Tuning 1B LLaMA 3.2: A Comprehensive Step-by-Step Guide with Code By ImranzamanML β’ Oct 2, 2024 β’ 43
view article Article ColPali: Efficient Document Retrieval with Vision Language Models π By manu β’ Jul 5, 2024 β’ 186
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models Paper β’ 2409.17146 β’ Published Sep 25, 2024 β’ 106
view article Article Llama can now see and run on your device - welcome Llama 3.2 Sep 25, 2024 β’ 180
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model Paper β’ 2409.01704 β’ Published Sep 3, 2024 β’ 83
view article Article SeeMoE: Implementing a MoE Vision Language Model from Scratch By AviSoori1x β’ Jun 23, 2024 β’ 34
view article Article seemore: Implement a Vision Language Model from Scratch By AviSoori1x β’ Jun 23, 2024 β’ 69
view article Article Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging By akjindal53244 β’ Aug 19, 2024 β’ 75