2 16

cheng wen

StudentWen

AI & ML interests

None yet

Recent Activity

liked a dataset about 1 month ago

O1-OPEN/OpenO1-SFT

reacted to WizardLM's post with 👍 7 months ago

🔥🔥🔥 Introducing WizardLM-2! 📙Release Blog: https://wizardlm.github.io/WizardLM2 ✅Model Weights: https://huggingface.co/collections/microsoft/wizardlm-661d403f71e6c8257dbd598a 🐦Twitter: https://twitter.com/WizardLM_AI/status/1779899325868589372 We introduce and opensource WizardLM-2, our next generation state-of-the-art large language models, which have improved performance on complex chat, multilingual, reasoning and agent. New family includes three cutting-edge models: WizardLM-2 8x22B, WizardLM-2 70B, and WizardLM-2 7B. WizardLM-2 8x22B is our most advanced model, and the best opensource LLM in our internal evaluation on highly complex tasks. WizardLM-2 70B reaches top-tier reasoning capabilities and is the first choice in the same size. WizardLM-2 7B is the fastest and achieves comparable performance with existing 10x larger opensource leading models. 🤗 WizardLM 2 Capacities: 1. MT-Bench (Figure-1) The WizardLM-2 8x22B even demonstrates highly competitive performance compared to the most advanced proprietary works such as GPT-4-Trubo and Glaude-3. Meanwhile, WizardLM-2 7B and WizardLM-2 70B are all the top-performing models among the other leading baselines at 7B to 70B model scales. 2. Human Preferences Evaluation (Figure 2) Through this human preferences evaluation, WizardLM-2's capabilities are very close to the cutting-edge proprietary models such as GPT-4-1106-preview, and significantly ahead of all the other open source models. 🔍Method Overview: As the natural world's human-generated data becomes increasingly exhausted through LLM training, we believe that: the data carefully created by AI and the model step-by-step supervised by AI will be the sole path towards more powerful AI. In the past one year, we built a fully AI powered synthetic training system. (As shown in the Figure 3).

liked a dataset 7 months ago

OpenGVLab/ShareGPT-4o

View all activity

Organizations

StudentWen's activity

liked a dataset about 1 month ago

O1-OPEN/OpenO1-SFT

Viewer • Updated 28 days ago • 77.7k • 2.09k • 324

liked a dataset 7 months ago

OpenGVLab/ShareGPT-4o

Viewer • Updated Aug 17, 2024 • 59.4k • 16.7k • 153

liked a dataset 10 months ago

FreedomIntelligence/ALLaVA-4V

Viewer • Updated Sep 2, 2024 • 143k • 338 • 76

liked a model 10 months ago

BELLE-2/Belle-whisper-large-v3-zh

Automatic Speech Recognition • Updated 29 days ago • 435 • 100

liked a dataset 10 months ago

m-a-p/Code-Feedback

Viewer • Updated Feb 26, 2024 • 66.4k • 159 • 204

liked a dataset 12 months ago

BelleGroup/generated_chat_0.4M

Viewer • Updated Apr 8, 2023 • 396k • 102 • 65

liked a dataset about 1 year ago

BELLE-2/train_3.5M_CN_With_Category

Preview • Updated Oct 18, 2023 • 81 • 22

liked a model about 1 year ago

BELLE-2/Belle-distilwhisper-large-v2-zh

Automatic Speech Recognition • Updated Jun 21, 2024 • 107 • 36

liked a dataset about 1 year ago

m-a-p/COIG-CQIA

Viewer • Updated Apr 18, 2024 • 44.7k • 1.96k • 594

liked 3 models about 1 year ago

liked 2 models over 1 year ago

BELLE-2/BELLE-Llama2-13B-chat-0.4M

Text2Text Generation • Updated Oct 18, 2023 • 576 • 31

THUDM/chatglm2-6b-32k

Updated Aug 4, 2024 • 222 • 294

liked 2 models about 2 years ago

IDEA-CCNL/Taiyi-Stable-Diffusion-1B-Chinese-v0.1

Text-to-Image • Updated May 25, 2023 • 385 • 433

google-bert/bert-base-chinese

Fill-Mask • Updated Feb 19, 2024 • 1.73M • 1.07k