2 3 14

Wenda Xu

xu1998hz

AI & ML interests

LLM alignment, Text generation evaluation metrics

Recent Activity

liked a model 11 days ago

xu1998hz/sescore2_en_pretrained

updated a model 11 days ago

xu1998hz/sescore2_en_pretrained

upvoted a paper 4 months ago

Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling

View all activity

Organizations

None yet

xu1998hz's activity

liked a model 11 days ago

xu1998hz/sescore2_en_pretrained

Updated 11 days ago • 1

updated a model 11 days ago

xu1998hz/sescore2_en_pretrained

Updated 11 days ago • 1

upvoted a paper 4 months ago

Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling

Paper • 2410.11325 • Published Oct 15, 2024 • 1

updated a model 6 months ago

xu1998hz/supervised_kd_math

Text Generation • Updated Sep 3, 2024 • 9

updated a model 7 months ago

xu1998hz/gemma-code-2b

Text Generation • Updated Aug 29, 2024 • 8

liked a model 7 months ago

xu1998hz/gemma-gsm8k-2b

Text Generation • Updated Jul 31, 2024 • 10 • 1

updated 2 models 8 months ago

xu1998hz/gemma-gsm8k-2b

Text Generation • Updated Jul 31, 2024 • 10 • 1

xu1998hz/gemma-gsm8k-7b

Text Generation • Updated Jul 31, 2024 • 12

updated 3 models 9 months ago

upvoted a paper 9 months ago

BPO: Supercharging Online Preference Learning by Adhering to the Proximity of Behavior LLM

Paper • 2406.12168 • Published Jun 18, 2024 • 7

liked a model 10 months ago

OpenAssistant/reward-model-deberta-v3-large-v2

Text Classification • Updated Feb 1, 2023 • 14.1k • • 214

updated 7 models 11 months ago

xu1998hz/7_sft_lora_256

Updated May 2, 2024

xu1998hz/6_sft_lora_256

Updated May 2, 2024

xu1998hz/5_sft_lora_256

Updated May 2, 2024

xu1998hz/4_sft_lora_256

Updated May 2, 2024

xu1998hz/3_sft_lora_256

Updated May 2, 2024

xu1998hz/2_sft_lora_256

Updated May 2, 2024

xu1998hz/1_sft_lora_256

Updated May 2, 2024