Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
jkazdan
/
anthropic_dpo_gemma_2_2b_helpsteer2
like
0
Safetensors
gemma2
trl
dpo
Generated from Trainer
License:
gemma
Model card
Files
Files and versions
Community
54776be
anthropic_dpo_gemma_2_2b_helpsteer2
/
training_args.bin
Commit History
jkazdan/synthetic-dpo-gemma-2-2b-helpsteer2
54776be
verified
jkazdan
commited on
Oct 13, 2024
jkazdan/synthetic-dpo-gemma-2-2b-helpsteer2
79a9763
verified
jkazdan
commited on
Oct 12, 2024