Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
IrwinD
/
log_sage_reward_model
like
0
Text Classification
Transformers
Safetensors
hdfs_rlhf_log_summary_dataset
distilbert
trl
reward-trainer
Generated from Trainer
Eval Results
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
6e34384
log_sage_reward_model
1 contributor
History:
7 commits
IrwinD
End of training
6e34384
verified
10 months ago
.gitattributes
Safe
1.52 kB
initial commit
10 months ago
README.md
4.29 kB
End of training
10 months ago
config.json
Safe
655 Bytes
End of training
10 months ago
model.safetensors
Safe
268 MB
LFS
End of training
10 months ago
special_tokens_map.json
Safe
125 Bytes
Model save
10 months ago
tokenizer.json
Safe
711 kB
Model save
10 months ago
tokenizer_config.json
Safe
1.25 kB
End of training
10 months ago
training_args.bin
Safe
4.98 kB
LFS
End of training
10 months ago
vocab.txt
Safe
232 kB
Model save
10 months ago