nguyenphuthien (Thien Phu Nguyen)

upvoted a paper 2 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 263

upvoted a paper 5 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 171

upvoted an article 5 months ago

Article

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚

By

•

Aug 26, 2024

• 51

upvoted a paper 8 months ago

SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages

Paper • 2407.19672 • Published Jul 29, 2024 • 56

upvoted 2 papers 9 months ago

The Prompt Report: A Systematic Survey of Prompting Techniques

Paper • 2406.06608 • Published Jun 6, 2024 • 60

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11931 • Published Jun 17, 2024 • 63

upvoted an article 9 months ago

Article

License to Call: Introducing Transformers Agents 2.0

May 13, 2024

• 130

upvoted 3 papers 10 months ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29, 2024 • 120

WildChat: 1M ChatGPT Interaction Logs in the Wild

Paper • 2405.01470 • Published May 2, 2024 • 62

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2, 2024 • 121

upvoted a collection 10 months ago

Handbook v0.1 models and datasets

Collection

Models and datasets for v0.1 of the alignment handbook • 6 items • Updated Nov 10, 2023 • 24

upvoted a collection 11 months ago

Awesome SFT datasets

Collection

A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12, 2024 • 129

upvoted a paper about 1 year ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 610

upvoted a collection about 1 year ago

Medical QA Datasets

Collection

A collection of medical question answering (QA) datasets • 23 items • Updated 20 days ago • 33

upvoted 2 papers about 1 year ago

MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

Paper • 2401.04081 • Published Jan 8, 2024 • 71

Blending Is All You Need: Cheaper, Better Alternative to Trillion-Parameters LLM

Paper • 2401.02994 • Published Jan 4, 2024 • 49

upvoted a paper over 1 year ago

QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

Paper • 2309.14717 • Published Sep 26, 2023 • 44

Thien Phu Nguyen

AI & ML interests

Organizations

nguyenphuthien's activity

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Differential Transformer

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚

SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages

The Prompt Report: A Systematic Survey of Prompting Techniques

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

License to Call: Introducing Transformers Agents 2.0

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

WildChat: 1M ChatGPT Interaction Logs in the Wild

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Handbook v0.1 models and datasets

Awesome SFT datasets

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Medical QA Datasets

MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

Blending Is All You Need: Cheaper, Better Alternative to Trillion-Parameters LLM

QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models