See our paper at https://huggingface.co/papers/2405.19332.
Shenao Zhang
ZhangShenao
AI & ML interests
None yet
Recent Activity
updated
a model
5 minutes ago
ZhangShenao/math_gsm-Meta-Llama-3-8B-Instruct-m-iter-1_sample_2500_nsk_ml512_mlr5e-5
updated
a dataset
14 minutes ago
ZhangShenao/math_gsm-Meta-Llama-3-8B-Instruct-iter1_sample_2500_nsk_ml512
updated
a model
16 minutes ago
ZhangShenao/math_gsm-Meta-Llama-3-8B-Instruct-e-iter-1_sample_2500_nsk_ml512_mlr5e-5
Organizations
Collections
3
-
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-3
Text Generation • Updated • 77 • 5 -
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-2
Text Generation • Updated • 26 -
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-1
Text Generation • Updated • 32 -
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment
Paper • 2405.19332 • Published • 15
models
218
ZhangShenao/math_gsm-Meta-Llama-3-8B-Instruct-m-iter-1_sample_2500_nsk_ml512_mlr5e-5
Updated
ZhangShenao/math_gsm-Meta-Llama-3-8B-Instruct-e-iter-1_sample_2500_nsk_ml512_mlr5e-5
Updated
ZhangShenao/math_math-gemma-2-9b-it-m-iter-1_sample_2500_nsk_ml512
Updated
ZhangShenao/math_math-gemma-2-9b-it-e-iter-1_sample_2500_nsk_ml512
Updated
ZhangShenao/math_math-gemma-1.1-7b-it-m-iter-3_sample_2500_nsk_ml512
Updated
ZhangShenao/math_math-gemma-1.1-7b-it-e-iter-3_sample_2500_nsk_ml512
Updated
ZhangShenao/math_gsm-gemma-1.1-7b-it-m-iter-3_sample_2500_nsk_ml512
Updated
ZhangShenao/math_gsm-gemma-1.1-7b-it-e-iter-3_sample_2500_nsk_ml512
Updated
ZhangShenao/math_math-gemma-1.1-7b-it-m-iter-2_sample_2500_nsk_ml512
Updated
•
2
ZhangShenao/math_gsm-gemma-2-9b-it-m-iter-1_sample_2500_nsk_ml512
Updated
datasets
134
ZhangShenao/math_gsm-Meta-Llama-3-8B-Instruct-iter1_sample_2500_nsk_ml512
Viewer
•
Updated
•
2.5k
•
3
ZhangShenao/math_math-gemma-2-9b-it-iter1_sample_2500_nsk_ml512
Updated
ZhangShenao/math_math-gemma-1.1-7b-it-iter3_sample_2500_nsk_ml512
Viewer
•
Updated
•
2.5k
ZhangShenao/math_gsm-gemma-1.1-7b-it-iter3_sample_2500_nsk_ml512
Viewer
•
Updated
•
2.47k
ZhangShenao/math_math-gemma-1.1-7b-it-iter2_sample_2500_nsk_ml512
Viewer
•
Updated
•
2.5k
•
1
ZhangShenao/math_gsm-gemma-2-9b-it-iter1_sample_2500_nsk_ml512
Viewer
•
Updated
•
2.5k
•
1
ZhangShenao/math_gsm-gemma-1.1-7b-it-iter2_sample_2500_nsk_ml512
Viewer
•
Updated
•
2.5k
•
1
ZhangShenao/math_math-gemma-2-9b-it-iter3_sample_2500_nsk_ml512_self
Viewer
•
Updated
•
7.5k
•
1
ZhangShenao/math_math-gemma-1.1-7b-it-iter3_sample_2500_nsk_ml512_self
Viewer
•
Updated
•
7.5k
•
1
ZhangShenao/math_gsm-gemma-2-9b-it-iter3_sample_2500_nsk_ml512_self
Viewer
•
Updated
•
7.47k
•
1