arxiv:2412.06676
Konstantin Dobler
konstantindobler
·
AI & ML interests
Natural Language Processing, Transfer Learning, Crosslingual Transfer
Recent Activity
upvoted
a
paper
12 days ago
I Don't Know: Explicit Modeling of Uncertainty with an [IDK] Token
Organizations
models
20
konstantindobler/mistral7b-ar-tokenizer-swap-pure-bf16
Text Generation
•
Updated
•
9
konstantindobler/mistral7b-de-pure-bf16
Text Generation
•
Updated
•
8
konstantindobler/mistral7b-de-tokenizer-swap-mixed-bf16
Text Generation
•
Updated
•
18
konstantindobler/mistral7b-de-mixed-bf16
Text Generation
•
Updated
•
8
konstantindobler/mistral7b-de-tokenizer-swap-pure-bf16-v2-anneal-ablation
Text Generation
•
Updated
•
10
konstantindobler/mistral7b-de-tokenizer-swap-pure-bf16-v2
Text Generation
•
Updated
•
12
konstantindobler/mistral7b-ar-tokenizer-swap-pure-bf16-anneal-ablation
Text Generation
•
Updated
•
15
konstantindobler/mistral7b-de-tokenizer-swap-pure-bf16
Text Generation
•
Updated
•
13
konstantindobler/fasttext-ar-sentencepiece-bpe-32k
Updated
konstantindobler/fasttext-de-sentencepiece-bpe-32k
Updated
datasets
None public yet