Turmbücher Language Model
Language model (embedding model) for Early Modern German (focusing on Swiss texts of the 16th century).
Part of the developments at the Digital Humanities@University of Bern. Developed by Ismail Prada Ziegler based on different texts (see below).
This repository contains the language models (forward & backward) that were used to train the Turmbücher NER.
Two models for premodern German trained by Ismail Prada Ziegler as part of a research project at the University of Bern, Digital Humanities.
We recommend using flairs stacked embeddings for the best effect.
Data Set
Main data set: Berner Turmbücher, early volumes from 16th C., Early New High German, 61k tokens training data.
Secondary data sets:
- SSRQ - Fribourg, 59k tokens.
- Chorgerichtsmanuale (unpublished), 76k tokens.
- Königsfelden Charters, 623k tokens.
- Talgerichtsprotokolle (unpublished), 438k tokens.
- Downloads last month
- 5
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
HF Inference deployability: The HF Inference API does not support text-generation models for flair
library.