DarshanDeshpande/distilbert_social_reasoning_reward_model Text Classification • Updated Mar 10, 2024 • 12
Holarissun/RM-HH-Gemma_harmless_gpt3_20000_gemma2b_shuffleFalse_extractchosenTrue Updated Apr 19, 2024 • 1
Holarissun/RM-HH-Gemma_harmless_gpt3_20000_gemma2b_shuffleFalse_extractchosenFalse Updated Apr 19, 2024 • 1
Holarissun/RM-HH-Gemma_harmless_gpt3_20000_gemma2b_shuffleTrue_extractchosenTrue Updated Apr 19, 2024 • 1
Holarissun/RM-HH-Mix_harmless_gpt3_20000_gemma2b_shuffleFalse_extractchosenFalse Updated Apr 19, 2024
Holarissun/RM-HH-AllMixNonPeft_harmless_gpt3_20000_gpt2-large_shuffleFalse_extractchosenTrue Text Classification • Updated Apr 22, 2024 • 4
Holarissun/RM-HH-AllMixNonPeft_harmless_gpt3_20000_gpt2-large_shuffleTrue_extractchosenFalse Text Classification • Updated Apr 22, 2024 • 4
Holarissun/RM-HH-AllMixNonPeft_harmless_gpt3_20000_gpt2-large_shuffleFalse_extractchosenFalse Text Classification • Updated Apr 22, 2024 • 4
Holarissun/RM-HH-AllMixNonPeft_harmless_gpt3_20000_gpt2-large_shuffleTrue_extractchosenTrue Text Classification • Updated Apr 22, 2024 • 10
Holarissun/RM-HH-AllMix_harmless_gpt3_20000_gemma2b_shuffleFalse_extractchosenTrue Updated Apr 22, 2024
Holarissun/RM-HH-AllMix_harmless_gpt3_20000_gemma2b_shuffleFalse_extractchosenFalse Updated Apr 22, 2024 • 1