Daniel Bourke's picture

Daniel Bourke PRO

mrdbourke

·

https://www.mrdbourke.com

AI & ML interests

Computer vision. Small on-device models. VLMs. High-quality tutorials.

Recent Activity

liked a model 4 days ago

tencent/Hunyuan3D-2

liked a Space 4 days ago

Qwen/Qwen2.5-Max-Demo

upvoted a collection 4 days ago

View all activity

Organizations

None yet

mrdbourke's activity

upvoted 2 collections 4 days ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 8 days ago • 96

Mistral Small

5 items • Updated 4 days ago • 4

upvoted 2 articles 4 days ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

20 days ago

• 132

Article

Timm ❤️ Transformers: Use any timm model with transformers

19 days ago

• 37

upvoted an article 5 days ago

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

• 546

upvoted 2 collections 5 days ago

SmolVLM

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated Dec 22, 2024 • 32

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated 8 days ago • 311

upvoted an article 6 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

7 days ago

• 587

upvoted a collection about 2 months ago

InternVL2.5

Better than InternVL 2.0 • 18 items • Updated 24 days ago • 81

upvoted 2 papers 2 months ago

MARS: Unleashing the Power of Variance Reduction for Training Large Models

Paper • 2411.10438 • Published Nov 15, 2024 • 13

Multimodal Autoregressive Pre-training of Large Vision Encoders

Paper • 2411.14402 • Published Nov 21, 2024 • 43

upvoted an article 3 months ago

Article

Visually Multilingual: Introducing mcdse-2b

By

•

Oct 27, 2024

• 38

upvoted 2 collections 3 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated Dec 22, 2024 • 211

Granite 3.0 Language Models

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated Dec 18, 2024 • 95

upvoted a paper 3 months ago

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 126

upvoted a collection 3 months ago

Stable Diffusion 3.5

6 items • Updated 25 days ago • 128

upvoted 2 articles 4 months ago

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22, 2024

• 85

Article

Let's talk about LLM evaluation

By

•

May 23, 2024

• 148

upvoted 2 collections 4 months ago

PaliGemma Release

Pretrained and mix checkpoints for PaliGemma • 16 items • Updated Dec 13, 2024 • 144

SigLIP

Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated Dec 13, 2024 • 50