Jens van Holland's picture

1 8 34

Jens van Holland

jvh

jvhgit

AI & ML interests

Deep Learning, NLP, applications and Data Science

Organizations

None yet

jvh's activity

upvoted a collection 7 months ago

GLM-4

GLM-4 Open Models • 13 items • Updated 30 days ago • 115

upvoted a paper 7 months ago

DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

Paper • 2401.14196 • Published Jan 25 • 47

upvoted an article 8 months ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22

• 228

upvoted 3 papers 9 months ago

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28 • 104

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 604

Model Stock: All we need is just a few fine-tuned models

Paper • 2403.19522 • Published Mar 28 • 10

upvoted a collection 9 months ago

INT4/8 Quantized Whisper CT2

Int4/8 Quantized Whisper Models by using the quanto package and the CTranslate2 package. Requires (much) less GPU resources while keeping performance. • 4 items • Updated Mar 19 • 2

upvoted a paper 10 months ago

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

Paper • 2402.10986 • Published Feb 16 • 77