Dayan Ruben's picture

83 200

Dayan Ruben

dayanruben

·

https://dayanruben.com

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 hour ago

Welcome Llama 4 Maverick & Scout on Hugging Face!

liked a model about 1 hour ago

meta-llama/Llama-4-Maverick-17B-128E-Original

liked a model about 1 hour ago

meta-llama/Llama-4-Maverick-17B-128E-Instruct-Original

View all activity

Organizations

dayanruben's activity

upvoted an article about 1 hour ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

about 21 hours ago

• 20

upvoted a collection about 1 hour ago

Llama 4

Llama 4 release • 10 items • Updated about 1 hour ago • 105

upvoted a collection 2 days ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 8 items • Updated 2 days ago • 88

upvoted an article 3 days ago

Article

Open R1: How to use OlympicCoder locally for coding?

17 days ago

• 56

upvoted 2 collections 10 days ago

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 3 items • Updated 10 days ago • 77

TxGemma Release

Collection of open models to accelerate the development of therapeutics. • 5 items • Updated 2 days ago • 39

upvoted a collection 14 days ago

💫StarVector Models

StarVector is a multimodal LLM for Scalable Vector Graphics (SVG) generation, producing structured SVG code directly from images and text. • 2 items • Updated 16 days ago • 90

upvoted a collection 24 days ago

Gemma 3 Release

17 items • Updated 2 days ago • 310

upvoted 2 articles about 2 months ago

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

• 955

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.21k

upvoted 3 collections 2 months ago

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 141

DataGemma Release

A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated 2 days ago • 85

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated 5 days ago • 436

upvoted 2 articles 2 months ago

Article

Hugging Face + PyCharm

Nov 5, 2024

• 28

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 456

upvoted 2 collections 2 months ago

DeepSeek R1 (All Versions)

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated 6 days ago • 215

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated Feb 26 • 111