Daniel Dahlmeier's picture

1 3

Daniel Dahlmeier

ddahlmeier

ddahlmeier

AI & ML interests

NLP

Recent Activity

updated a model 13 days ago

ddahlmeier/open_llama_3b_v2_chat_dolly

updated a model 15 days ago

ddahlmeier/llama-3.2-1B-sutdqa

published a model 15 days ago

ddahlmeier/llama-3.2-1B-sutdqa

View all activity

Organizations

None yet

ddahlmeier's activity

updated a model 13 days ago

ddahlmeier/open_llama_3b_v2_chat_dolly

Text Generation • Updated 13 days ago • 11

updated a model 15 days ago

ddahlmeier/llama-3.2-1B-sutdqa

Text Generation • Updated 15 days ago • 2

published a model 15 days ago

ddahlmeier/llama-3.2-1B-sutdqa

Text Generation • Updated 15 days ago • 2

updated a model 15 days ago

ddahlmeier/llama-3.2-1B-sutdqa-lora

Text Generation • Updated 15 days ago • 19

published a model 15 days ago

ddahlmeier/llama-3.2-1B-sutdqa-lora

Text Generation • Updated 15 days ago • 19

updated a model 15 days ago

ddahlmeier/llama-3.1-1B-aws

Text Generation • Updated 15 days ago • 36

updated a dataset 15 days ago

ddahlmeier/sutd_qa_dataset

Viewer • Updated 15 days ago • 200 • 37

updated a model 22 days ago

ddahlmeier/sutd-llama3-qa

Text Generation • Updated 22 days ago • 12

published a model 22 days ago

ddahlmeier/sutd-llama3-qa

Text Generation • Updated 22 days ago • 12

published a model about 1 month ago

ddahlmeier/llama-3.1-1B-aws

Text Generation • Updated 15 days ago • 36

updated a model about 1 month ago

ddahlmeier/llama-3.1-1B

Text Generation • Updated Mar 9 • 2

published a model about 1 month ago

ddahlmeier/llama-3.1-1B

Text Generation • Updated Mar 9 • 2

reacted to philschmid's post with 👍 7 months ago

Post

What's the best way to fine-tune open LLMs in 2024? Look no further! 👀 I am excited to share “How to Fine-Tune LLMs in 2024 with Hugging Face” using the latest research techniques, including Flash Attention, Q-LoRA, OpenAI dataset formats (messages), ChatML, Packing, all built with Hugging Face TRL. 🚀

It is created for consumer-size GPUs (24GB) covering the full end-to-end lifecycle with:
💡Define and understand use cases for fine-tuning
🧑🏻‍💻 Setup of the development environment
🧮 Create and prepare dataset (OpenAI format)
🏋️‍♀️ Fine-tune LLM using TRL and the SFTTrainer
🥇 Test and evaluate the LLM
🚀 Deploy for production with TGI

👉 https://www.philschmid.de/fine-tune-llms-in-2024-with-trl

Coming soon: Advanced Guides for multi-GPU/multi-Node full fine-tuning and alignment using DPO & KTO. 🔜

4 replies

·

upvoted a paper 7 months ago

LMDX: Language Model-based Document Information Extraction and Localization

Paper • 2309.10952 • Published Sep 19, 2023 • 65

upvoted a paper 8 months ago

EXAONE 3.0 7.8B Instruction Tuned Language Model

Paper • 2408.03541 • Published Aug 7, 2024 • 36

updated a model 12 months ago

ddahlmeier/llama-7b-qlora-sutd-qa

Text Generation • Updated Apr 25, 2024

updated 2 models about 1 year ago

ddahlmeier/llama-7b-qlora-ultrachat

Updated Mar 31, 2024

ddahlmeier/llama-7b-sutd-qa

Updated Mar 30, 2024

updated a dataset about 1 year ago

ddahlmeier/sutd_instruct

Viewer • Updated Feb 17, 2024 • 30 • 22