Turmbücher Language Model

Language model (embedding model) for Early Modern German (focusing on Swiss texts of the 16th century).

Part of the developments at the Digital Humanities@University of Bern. Developed by Ismail Prada Ziegler based on different texts (see below).

This repository contains the language models (forward & backward) that were used to train the Turmbücher NER.

Two models for premodern German trained by Ismail Prada Ziegler as part of a research project at the University of Bern, Digital Humanities.

We recommend using flairs stacked embeddings for the best effect.

Data Set

Main data set: Berner Turmbücher, early volumes from 16th C., Early New High German, 61k tokens training data.

Secondary data sets: