Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Gredora
/
Llama-2-7b-ORM-LoRA
like
0
Transformers
TensorBoard
Safetensors
RLHFlow/Deepseek-ORM-Data-Pairwise
Generated from Trainer
trl
reward-trainer
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
Llama-2-7b-ORM-LoRA
/
runs
Ctrl+K
Ctrl+K
1 contributor
History:
9 commits
Gredora
Model save
df5e903
verified
25 days ago
Mar13_23-31-11_odin2
Training in progress, step 500
28 days ago
Mar13_23-36-35_odin2
Model save
25 days ago