Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
universe99
/
dpo_ilk_training
like
0
Text Generation
Transformers
Safetensors
English
gemma
text-generation-inference
unsloth
trl
dpo
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
dpo_ilk_training
Ctrl+K
Ctrl+K
1 contributor
History:
4 commits
universe99
Trained with Unsloth
c44a0fa
verified
11 months ago
.gitattributes
Safe
1.57 kB
Upload tokenizer
11 months ago
README.md
Safe
559 Bytes
Trained with Unsloth
11 months ago
config.json
Safe
720 Bytes
Trained with Unsloth
11 months ago
generation_config.json
Safe
132 Bytes
Trained with Unsloth
11 months ago
model-00001-of-00002.safetensors
Safe
4.95 GB
LFS
Trained with Unsloth
11 months ago
model-00002-of-00002.safetensors
Safe
67.1 MB
LFS
Trained with Unsloth
11 months ago
model.safetensors.index.json
Safe
13.5 kB
Trained with Unsloth
11 months ago
special_tokens_map.json
Safe
636 Bytes
Upload tokenizer
11 months ago
tokenizer.json
Safe
17.5 MB
LFS
Upload tokenizer
11 months ago
tokenizer.model
Safe
4.24 MB
LFS
Upload tokenizer
11 months ago
tokenizer_config.json
Safe
40 kB
Upload tokenizer
11 months ago