Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
ShenaoZ
/
0.001_ablation_4iters_bs256_only2third_iter_1
like
0
Text Generation
Transformers
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
1c18841
0.001_ablation_4iters_bs256_only2third_iter_1
Commit History
Model save
1c18841
verified
ShenaoZ
commited on
Apr 26
End of training
fea1bce
verified
ShenaoZ
commited on
Apr 26
Model save
ddcf16e
verified
ShenaoZ
commited on
Apr 26
initial commit
6bd085b
verified
ShenaoZ
commited on
Apr 26