Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
xiryss
/
llm-course-hw2-reward-model
like
0
Text Classification
Transformers
Safetensors
HumanLLMs/Human-Like-DPO-Dataset
llama
Generated from Trainer
trl
reward-trainer
text-generation-inference
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
llm-course-hw2-reward-model
Commit History
Update README.md
1bb0fa0
verified
xiryss
commited on
14 days ago
Update README.md
334c31b
verified
xiryss
commited on
14 days ago
Update README.md
8d05d1f
verified
xiryss
commited on
14 days ago
Update README.md
2f90094
verified
xiryss
commited on
14 days ago
Update README.md
3906274
verified
xiryss
commited on
14 days ago
Update README.md
721e276
verified
xiryss
commited on
14 days ago
Update README.md
26f11a0
verified
xiryss
commited on
14 days ago
xiryss/llm-course-hw2-reward-model
124b5b9
verified
xiryss
commited on
14 days ago
xiryss/llm-course-hw2-reward-model
61b2e7a
verified
xiryss
commited on
15 days ago
xiryss/llm-course-hw2-reward-model
3634524
verified
xiryss
commited on
15 days ago
xiryss/llm-course-hw2-reward-model
7e6b0e5
verified
xiryss
commited on
16 days ago
xiryss/llm-course-hw2-reward-model
2c33e2a
verified
xiryss
commited on
16 days ago
initial commit
13fef5f
verified
xiryss
commited on
16 days ago