Model Card for text-complexity-classification

The model classifies texts into the language complexity classes (German language):

  • easy language / leichte Sprache
  • simple language / einfache Sprache
  • everyday language / Alltagssprache
  • special language / Fachsprache

The underlying corpus was trained on the basis of over 300,000 texts of the mentioned language categories. Freely available websites served as sources. Thematic diversity was taken into account when selecting the sources.

Evaluation

  • f1: 0.982
  • Precision: 0.981
  • Recall: 0.983

How to use

>>> from transformers import pipeline
>>> classifier = pipeline(model="krupper/text-complexity-classification")
>>> classifier("Bei Kleinkindern unter 2 Jahren liegen nur begrenzte Erfahrungen zur Pharmakokinetik vor.")
[{'label': 'special_language', 'score': 0.999923825263977}]

DOI: https://doi.org/10.57967/hf/0131

Downloads last month
23
Safetensors
Model size
110M params
Tensor type
I64
ยท
F32
ยท
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Space using krupper/text-complexity-classification 1