3 2 15

Yixuan Wei

EasonWei

weiyx16

AI & ML interests

None yet

Recent Activity

commented a paper about 1 month ago

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

commented a paper about 1 month ago

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

upvoted a paper about 2 months ago

On Memorization of Large Language Models in Logical Reasoning

View all activity

Organizations

EasonWei's activity

commented 2 papers about 1 month ago

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Paper • 2410.19313 • Published Oct 25 • 19 •

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Paper • 2410.19313 • Published Oct 25 • 19 •

upvoted a paper about 2 months ago

On Memorization of Large Language Models in Logical Reasoning

Paper • 2410.23123 • Published Oct 30 • 18

liked 2 models 3 months ago

stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • Updated Sep 18 • 1.01M • 1.3k

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Aug 16 • 1.27M • • 7.5k

upvoted a collection 3 months ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated 28 days ago • 444

liked a dataset 4 months ago

edinburgh-dawg/mmlu-redux

Viewer • Updated Aug 9 • 3k • 2.98k • 26

New activity in 1bitLLM/bitnet_b1_58-3B 7 months ago

Is it bitnet {-1,0,1}?

#6 opened 9 months ago by

Remek

authored a paper 7 months ago

Xwin-LM: Strong and Scalable Alignment Practice for LLMs

Paper • 2405.20335 • Published May 30 • 17

authored a paper 10 months ago

Common 7B Language Models Already Possess Strong Math Capabilities

Paper • 2403.04706 • Published Mar 7 • 16

liked a model 12 months ago

adept/fuyu-8b

Image-Text-to-Text • Updated Nov 4, 2023 • 5.85k • 997

liked a dataset 12 months ago

fka/awesome-chatgpt-prompts

Viewer • Updated Sep 3 • 170 • 6.75k • 6.62k

liked a model about 1 year ago

01-ai/Yi-6B

Text Generation • Updated Nov 11 • 6.68k • 372

authored a paper about 1 year ago

FP8-LM: Training FP8 Large Language Models

Paper • 2310.18313 • Published Oct 27, 2023 • 33

liked a dataset about 1 year ago

openbmb/UltraFeedback

Viewer • Updated Dec 29, 2023 • 64k • 1.44k • 343

New activity in OpenAssistant/reward-model-deberta-v3-large-v2 over 1 year ago

Validation split indices?

#6 opened over 1 year ago by

cmglaze

liked a model over 1 year ago

tiiuae/falcon-7b

Text Generation • Updated Oct 12 • 93.9k • 1.08k

liked a Space over 1 year ago

Running on CPU Upgrade

12.1k

🏆

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

liked 2 models over 1 year ago

WizardLMTeam/WizardCoder-15B-V1.0

Text Generation • Updated Jan 19 • 1.51k • 749

bigscience/tr8-104B-logs

Updated Nov 30, 2021 • 5