ZhangShenao/bt-math_gsm-gemma-1.1-7b-it-iter_sample_7500_temp_1.0_gen_1 Viewer • Updated 8 days ago • 3.74k • 78
Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency Paper • 2309.17382 • Published Sep 29, 2023 • 5
ZhangShenao/math_gsm-Mistral-7B-Instruct-v0.2-msft-sample_7473_tp_unfreeze_qk_ep_10 Updated 13 days ago • 11
ZhangShenao/math_gsm-Mistral-7B-Instruct-v0.2-msft-sample_7473_tp_unfreeze_qk_ep_10 Updated 13 days ago • 11
ZhangShenao/math_gsm-Mistral-7B-Instruct-v0.2-msft-sample_7473_tp_unfreeze_noneep_10 Updated 13 days ago • 10
ZhangShenao/math_gsm-Mistral-7B-Instruct-v0.2-msft-sample_7473_tp_unfreeze_vo_ep_10 Updated 14 days ago • 10
ZhangShenao/math_gsm-Mistral-7B-Instruct-v0.2-msft-sample_7473_tp_unfreeze_noneep_10 Updated 13 days ago • 10
ZhangShenao/math_gsm-Mistral-7B-Instruct-v0.2-msft-sample_7473_tp_unfreeze_vo_ep_10 Updated 14 days ago • 10
ZhangShenao/math_gsm-Mistral-7B-Instruct-v0.2-msft-sample_7473_tp_unfreeze_reverse_vo Updated 14 days ago • 12
ZhangShenao/math_gsm-Mistral-7B-Instruct-v0.2-msft-sample_7473_tp_unfreeze_reverse_vo Updated 14 days ago • 12
ZhangShenao/math_gsm-gemma-1.1-7b-it-msft-sample_7473_tp_unfreeze_reverse_vo Updated 14 days ago • 16
ZhangShenao/math_gsm-gemma-1.1-7b-it-msft-sample_7473_tp_unfreeze_reverse_vo Updated 14 days ago • 16
ZhangShenao/math_gsm-Mistral-7B-Instruct-v0.2-msft-sample_7473_tp_unfreeze_qk Updated 14 days ago • 9
ZhangShenao/math_gsm-Mistral-7B-Instruct-v0.2-msft-sample_7473_tp_unfreeze_qk Updated 14 days ago • 9
ZhangShenao/math_gsm-Mistral-7B-Instruct-v0.2-msft-sample_7473_tp_unfreeze_vo Updated 14 days ago • 10