File size: 1,008 Bytes
573bcae
 
 
 
 
 
 
 
 
2cbc9b8
573bcae
f97d127
 
18e2c8e
 
 
 
 
 
c3895a4
 
18e2c8e
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
---
license: llama2
datasets:
- evilfreelancer/toxicator-ru
language:
- ru
tags:
- toxify
- detoxify
- seq2seq
pipeline_tag: translation
---

# LLaMA 2 7B - Toxicator RU

This fine-tuned model based on [meta-llama/Llama-2-7b-chat-hf](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf), it utilizes [evilfreelancer/toxicator-ru](https://huggingface.co/datasets/evilfreelancer/toxicator-ru) dataset created from samples in [s-nlp/russe_detox_2022](https://github.com/s-nlp/russe_detox_2022) project.

Model was tuned **just for lulz** for experimenting with [TorchTune](https://github.com/pytorch/torchtune) tool.

[100 examples](https://gist.github.com/EvilFreelancer/ac4215195b3c8b1e7dcd39ca51c47138) on GitHub Gist.

## Links

* https://github.com/EvilFreelancer/toxicator-ru - GitHub repository with train scripts and scripts for generating dataset
* https://huggingface.co/datasets/evilfreelancer/toxicator-ru - dataset
* https://api.wandb.ai/links/evilfreelancer/33t8pqze - wandb report about training