David Golchinfar's picture

David Golchinfar PRO

DavidGF

·

https://vago-solutions.ai

AI & ML interests

finetune llms, improve german language understanding and generated text of llms

Recent Activity

liked a dataset about 8 hours ago

open-thoughts/OpenThoughts2-1M

liked a Space 1 day ago

yourbench/demo

updated a model 1 day ago

DavidGF/SauerkrautTTS-Preview-0.1-Q8_0-GGUF

View all activity

Organizations

DavidGF's activity

upvoted an article 1 day ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

9 days ago

• 93

upvoted an article 3 months ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

• 169

upvoted a collection 5 months ago

🇫🇷 Calme-3

Here you can find all the new Calme-3 models • 27 items • Updated Feb 9 • 15

upvoted a paper 6 months ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7, 2024 • 13

upvoted an article 8 months ago

Article

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Jul 31, 2024

• 58

upvoted a collection 9 months ago

VAGO solutions quants

Quantized version for the excellent german speaking models created by VAGO solutions. • 6 items • Updated Apr 20, 2024 • 2

upvoted a collection 10 months ago

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Nov 28, 2024 • 361

upvoted a collection 11 months ago

📀 Dataset comparison models

1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12, 2024 • 37

upvoted 2 collections 12 months ago

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 733

🇩🇪German SFT and DPO datasets

Datasets that can be used for LLM training with axolotl, trl or llama_factory. • 33 items • Updated Jan 23 • 11

upvoted 3 papers about 1 year ago

Arcee's MergeKit: A Toolkit for Merging Large Language Models

Paper • 2403.13257 • Published Mar 20, 2024 • 20

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 612

Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens

Paper • 2401.17377 • Published Jan 30, 2024 • 36