suchen's picture

9 37

suchen

suc16

·

AI & ML interests

LLM

Recent Activity

liked a model 1 day ago

moonshotai/Moonlight-16B-A3B

upvoted an article 17 days ago

Proximal Policy Optimization (PPO)

upvoted a paper about 2 months ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

View all activity

Organizations

None yet

suc16's activity

upvoted an article 17 days ago

Article

Proximal Policy Optimization (PPO)

Aug 5, 2022

• 23

upvoted a paper about 2 months ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 90

upvoted a collection about 2 months ago

Cosmos

The collection of Cosmos models • 31 items • Updated Jan 17 • 262

upvoted a collection 8 months ago

BGE

23 items • Updated 11 days ago • 90

upvoted 5 papers over 1 year ago

CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Paper • 2308.07926 • Published Aug 15, 2023 • 28

DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales

Paper • 2308.01320 • Published Aug 2, 2023 • 45

Challenges and Applications of Large Language Models

Paper • 2307.10169 • Published Jul 19, 2023 • 48

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 244

Secrets of RLHF in Large Language Models Part I: PPO

Paper • 2307.04964 • Published Jul 11, 2023 • 29