File size: 678 Bytes
33265fb
8959dc5
56f4e90
8928304
8959dc5
33265fb
56f4e90
8cae0fb
56f4e90
8cae0fb
 
 
56f4e90
8cae0fb
56f4e90
583c00d
56f4e90
f1b36b5
 
 
 
 
 
2e735aa
f1b36b5
351ae34
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
---
license: cc-by-nc-3.0
datasets:
- FredZhang7/toxi-text-3M
pipeline_tag: text-classification
---

**I have decided to release the auto-moderation models all at once sometime in July. The curated datasets for training these models will be avaliable first.**

<br>

Finished training: 6/30/2023

Final Train & Validation Accuracy: 95-98%

Large model (v2) will be avaliable for PyTorch

Lightweight model and tokenizer (v1) will be avaliable for transformers.js

<br>

<br>

Models tested: roberta, xlm-roberta, bert-tiny, bert-base-cased/uncased, bert-multilingual-cased/uncased, alberta-large-v2

Model chosen based on cost-efficiency and performance: bert-multilingual-cased