Sachith Gunasekara's picture

6 10 3

Sachith Gunasekara

sachithgunasekara

·

AI & ML interests

Large Language Models (LLMs), Deep Learning, AI Safety/Privacy

Recent Activity

updated a dataset about 1 month ago

sachithgunasekara/phased-self-discover-mistral-unstructured-0-shot-bbh-eval

published a dataset about 1 month ago

sachithgunasekara/phased-self-discover-mistral-unstructured-0-shot-bbh-eval

updated a dataset about 1 month ago

sachithgunasekara/phased-self-discover-mistral-structured-5-shot-bbh-eval

View all activity

Organizations

sachithgunasekara's activity

upvoted an article about 1 month ago

Article

How to generate text: using different decoding methods for language generation with Transformers

Mar 1, 2020

• 192

upvoted 2 articles 7 months ago

Article

Introducing the SQL Console on Datasets

Sep 17, 2024

• 23

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25, 2024

• 188

upvoted 2 papers 7 months ago

How FaR Are Large Language Models From Agents with Theory-of-Mind?

Paper • 2310.03051 • Published Oct 4, 2023 • 35

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 140

upvoted an article 10 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 570

upvoted an article 11 months ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 140

upvoted a collection about 1 year ago

Open-Bezoar

Small, Cost-Effective and Open Models Trained on Mixes of Instruction Data • 7 items • Updated Apr 19, 2024 • 6