Thomas Anderson's picture

11 56

Thomas Anderson

farpluto

·

AI & ML interests

None yet

Recent Activity

updated a model 1 day ago

farpluto/SmolLM2-135M-Instruct-Q4_K_M-GGUF

published a model 1 day ago

farpluto/SmolLM2-135M-Instruct-Q4_K_M-GGUF

liked a dataset 1 day ago

LTCB/enwik8

View all activity

Organizations

None yet

farpluto's activity

upvoted a collection 17 days ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated 17 days ago • 339

upvoted a collection 4 months ago

Granite 3.0 Language Models

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated Dec 18, 2024 • 96

upvoted 2 collections 6 months ago

TriLMs-Unpacked

TriLMs unpacked to FP16 - compatible with any implementation supporting LLaMa architecture in huggingface's transformers format. • 9 items • Updated Jul 9, 2024 • 4

OpenCulture

A multilingual dataset of public domain books and newspapers. • 27 items • Updated Nov 6, 2024 • 123

upvoted an article 6 months ago

Article

How to train a new language model from scratch using Transformers and Tokenizers

Feb 14, 2020

• 27

upvoted 3 collections 6 months ago

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Nov 28, 2024 • 357

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 648

InternVL2.0

Expanding Performance Boundaries of Open-Source MLLM • 15 items • Updated Jan 10 • 91

upvoted a collection 7 months ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Dec 22, 2024 • 214

upvoted a collection 8 months ago

Core ML Gallery Models

7 items • Updated Oct 4, 2024 • 34

upvoted a collection 9 months ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated Jan 8 • 554