File size: 289 Bytes
ac011a5 |
1 2 3 4 5 6 7 8 9 10 11 |
---
license: apache-2.0
datasets:
- johannhartmann/steroids
- johannhartmann/oh25_mistral_dpo_de
language:
- de
- en
---
This is a simple experiment using geman ORPO training for one epoch using qlora and unsloth on [Vezora/Mistral-22B-v0.2](https://huggingface.coVezora/Mistral-22B-v0.2) |