File size: 289 Bytes
ac011a5
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
---
license: apache-2.0
datasets:
- johannhartmann/steroids
- johannhartmann/oh25_mistral_dpo_de
language:
- de
- en
---

This is a simple experiment using geman ORPO training for one epoch using qlora and unsloth on [Vezora/Mistral-22B-v0.2](https://huggingface.coVezora/Mistral-22B-v0.2)