File size: 674 Bytes
33265fb 8959dc5 56f4e90 8928304 8959dc5 33265fb 56f4e90 bee5fba 56f4e90 8cae0fb 56f4e90 8cae0fb 56f4e90 583c00d 56f4e90 f1b36b5 2e735aa f1b36b5 351ae34 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 |
---
license: cc-by-nc-3.0
datasets:
- FredZhang7/toxi-text-3M
pipeline_tag: text-classification
---
**I have decided to release all auto-moderation models at once sometime in July. The curated datasets for training these models will be avaliable first.**
<br>
Finished training: 6/30/2023
Final Train & Validation Accuracy: 95-98%
Large model (v2) will be avaliable for PyTorch
Lightweight model and tokenizer (v1) will be avaliable for transformers.js
<br>
<br>
Models tested: roberta, xlm-roberta, bert-tiny, bert-base-cased/uncased, bert-multilingual-cased/uncased, alberta-large-v2
Model chosen based on cost-efficiency and performance: bert-multilingual-cased |