Seungwoo Ryu's picture

Seungwoo Ryu PRO

tryumanshow

·

AI & ML interests

LLM, Agent

Recent Activity

liked a model 3 days ago

deepseek-ai/DeepSeek-R1

liked a model 4 days ago

deepseek-ai/DeepSeek-R1-Distill-Llama-70B

liked a dataset 5 days ago

NovaSky-AI/Sky-T1_data_17k

View all activity

Organizations

tryumanshow's activity

upvoted an article 5 days ago

Article

Fixing Gradient Accumulation

Oct 16, 2024

• 48

upvoted a collection 2 months ago

Korean Instruction Dataset

5 items • Updated about 20 hours ago • 5

upvoted a collection 3 months ago

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 104

upvoted a collection 4 months ago

Korean Reward Modeling

Korean Datasets, Reward Models for RLHF • 16 items • Updated Nov 19, 2024 • 3

upvoted a paper 4 months ago

DiaSynth -- Synthetic Dialogue Generation Framework

Paper • 2409.19020 • Published Sep 25, 2024 • 20

upvoted an article 5 months ago

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22, 2024

• 86

upvoted a collection 5 months ago

LLMs

376 items • Updated about 23 hours ago • 26

upvoted 2 papers 7 months ago

Octo-planner: On-device Language Model for Planner-Action Agents

Paper • 2406.18082 • Published Jun 26, 2024 • 48

Model Merging and Safety Alignment: One Bad Model Spoils the Bunch

Paper • 2406.14563 • Published Jun 20, 2024 • 30

upvoted 3 collections 8 months ago

Function Calling v3

Models fine-tuned for function-calling • 14 items • Updated Apr 27, 2024 • 20

Agents

Collection of resources related to Agents. • 71 items • Updated 9 days ago • 5

Miqu-based Models

A collection of creative writing models based on the 'miqu-1-70b ' model. • 9 items • Updated Dec 3, 2024 • 2

upvoted a collection 9 months ago

Agents

63 items • Updated 14 days ago • 5

upvoted a paper 9 months ago

Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3

Paper • 2405.00664 • Published May 1, 2024 • 20

upvoted an article 9 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 264

upvoted a collection 9 months ago

Long context

94 items • Updated Sep 29, 2024 • 30

upvoted an article 9 months ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22, 2024

• 233

upvoted a collection 9 months ago

Handbook v0.1 models and datasets

Models and datasets for v0.1 of the alignment handbook • 6 items • Updated Nov 10, 2023 • 24