WeihaoZeng

AndrewZeng

AI & ML interests

None yet

Recent Activity

updated a dataset 1 day ago
AndrewZeng/math-bstar-sample
updated a dataset 1 day ago
AndrewZeng/bstar-math-dev
updated a dataset 1 day ago
AndrewZeng/prm-reward-data
View all activity

Organizations

HKUST NLP Group's profile picture

AndrewZeng's activity

reacted to WizardLM's post with ๐Ÿ‘๐Ÿš€ 5 months ago
view post
Post
10512
๐Ÿ”ฅ ๐Ÿ”ฅ๐Ÿ”ฅ
Excited to announce WizardLM new Paper: Auto Evol-Instruct!

๐Ÿฆ Twitter: https://x.com/WizardLM_AI/status/1812857977122202087

๐Ÿ“ƒ Paper: https://arxiv.org/pdf/2406.00770

๐Ÿค– 1. Fully AI-Powered Pipeline

Auto Evol-Instruct automatically involves an iterative process of optimizing an Evol-Instruct V1 into an optimal one. The pipeline consists of two critical stages: Evol Trajectory Analysis, where the optimizer LLM analyzes the issues and failures exposed in instruction evolution performed by the evol LLM, and Evolving Method Optimization, where the optimizer LLM addresses these issues to progressively develop an effective evolving method. The optimal evolving method is then used to convert the entire instruction dataset into more diverse and complex forms, facilitating improved instruction tuning.

๐Ÿ“ˆ2. Scaling Evol-Instruct with Arena Learning

With Auto Evol-Instruct, the evolutionary synthesis data of WizardLM-2 has scaled up from WizardLM-1 to dozens of domains, covering tasks in all aspects of large language models. This allows Arena Learning to train and learn from an almost infinite pool of high-difficulty instruction data, fully unlocking all the potential of Arena Learning.
  • 1 reply
ยท
upvoted an article 6 months ago
view article
Article

Welcome Gemma 2 - Google's new open LLM

โ€ข 124