view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb • Nov 28, 2024 • 136
view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 550
view article Article SmolVLM Grows Smaller – Introducing the 250M & 500M Models! 13 days ago • 109
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference 20 days ago • 63
Multi-task retriever fine-tuning for domain-specific and efficient RAG Paper • 2501.04652 • Published 27 days ago • 10
MinMo: A Multimodal Large Language Model for Seamless Voice Interaction Paper • 2501.06282 • Published 25 days ago • 43
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs Paper • 2501.06186 • Published 25 days ago • 60
YuLan-Mini: An Open Data-efficient Language Model Paper • 2412.17743 • Published Dec 23, 2024 • 64
Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning Paper • 2412.15797 • Published Dec 20, 2024 • 17
FastVLM: Efficient Vision Encoding for Vision Language Models Paper • 2412.13303 • Published Dec 17, 2024 • 13
VisDoM: Multi-Document QA with Visually Rich Elements Using Multimodal Retrieval-Augmented Generation Paper • 2412.10704 • Published Dec 14, 2024 • 15
Multimodal Latent Language Modeling with Next-Token Diffusion Paper • 2412.08635 • Published Dec 11, 2024 • 44
ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification and KV Cache Compression Paper • 2410.08584 • Published Oct 11, 2024 • 12
view article Article 🤗 PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware Feb 10, 2023 • 49