arxiv:2408.08441
Anikait Singh
Asap7772
AI & ML interests
Deep Learning, Reinforcement Learning, Robotics
Organizations
models
None public yet
datasets
191
Asap7772/generative_rm_prompting_datapersona_gpt_modelgpt-4o-mini_num200
Viewer
•
Updated
•
400
•
5
Asap7772/generative_rm_prompting_datahh_harmless_modelgpt-4o-mini_num200
Viewer
•
Updated
•
400
•
3
Asap7772/generative_rm_prompting_datahh_helpful_modelgpt-4o-mini_num200
Viewer
•
Updated
•
400
•
3
Asap7772/generative_rm_prompting_datapersona_gpt_modelgpt-4o_num200
Viewer
•
Updated
•
400
•
3
Asap7772/generative_rm_prompting_datapersona_gpt_modelgpt-3.5-turbo_num200
Viewer
•
Updated
•
400
•
3
Asap7772/generative_rm_prompting_datahh_helpful_modelgpt-4o_num200
Viewer
•
Updated
•
400
•
3
Asap7772/generative_rm_prompting_datahh_harmless_modelgpt-3.5-turbo_num200
Viewer
•
Updated
•
400
•
3
Asap7772/generative_rm_prompting_datahh_helpful_modelgpt-3.5-turbo_num200
Viewer
•
Updated
•
400
•
3
Asap7772/generative_rm_prompting_datahh_harmless_modelgpt-4o_num200
Viewer
•
Updated
•
400
•
3
Asap7772/prm800k_onpolicy_multiturn_relabeled
Viewer
•
Updated
•
20k