Stefan Schweter's picture

Stefan Schweter PRO

stefan-it

·

AI & ML interests

Flair Library 💕, NER & PoS Tagging, LM Pretraining (mostly encoder-only & encoder-decoder), Historical Language Models

Recent Activity

upvoted a paper about 15 hours ago

Boundless Byte Pair Encoding: Breaking the Pre-tokenization Barrier

upvoted a paper about 15 hours ago

Multimodal LLMs for OCR, OCR Post-Correction, and Named Entity Recognition in Historical Documents

upvoted a paper about 15 hours ago

Overcoming Vocabulary Constraints with Pixel-level Fallback

View all activity

Organizations

stefan-it's activity

liked a dataset 3 days ago

huggingface-legal/takedown-notices

Viewer • Updated about 11 hours ago • 25 • 928 • 22

liked a dataset 9 days ago

BramVanroy/finewebs-copyright-domains

Viewer • Updated 9 days ago • 361 • 35 • 1

liked 2 models 10 days ago

stanfordnlp/mrt5-large

Text2Text Generation • Updated 8 days ago • 69 • 2

stanfordnlp/mrt5-small

Text2Text Generation • Updated 8 days ago • 86 • 2

liked a Space 15 days ago

FAT5 (Flash Attention T5) report

English version of the blog post introducing FAT5 model

liked a Space 16 days ago

Follow History

Track history of Follows of organizations and users on HF

liked a dataset 18 days ago

bbunzeck/babylm-german

Viewer • Updated 17 days ago • 1.88M • 55 • 2

liked a dataset 24 days ago

Open-Orca/FLAN

Viewer • Updated Aug 2, 2023 • 378M • 13.4k • 176

liked a model about 1 month ago

chandar-lab/NeoBERT

Feature Extraction • Updated 10 days ago • 7.16k • 102

liked a Space about 1 month ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

liked a dataset about 1 month ago

google/smol

Viewer • Updated Mar 3 • 811k • 3.8k • 50

liked a dataset about 2 months ago

BUCOLIN/HisTR

Viewer • Updated Feb 3 • 25.3k • 96 • 3

liked a model about 2 months ago

Rijgersberg/GEITje-7B

Text Generation • Updated Jan 26 • 1.25k • 19

liked a dataset about 2 months ago

MultiCoNER/multiconer_v2

Viewer • Updated Jul 6, 2023 • 2.71M • 1.52k • 14

liked a dataset 2 months ago

oberbics/Multilingual_Topic-Specific_Article-Extraction_and_Classification

Viewer • Updated Jan 31 • 874 • 162 • 1

liked a model 2 months ago

amunozo/pixel-base-german

Updated Dec 12, 2024 • 16 • 2

liked a dataset 3 months ago

TurkuNLP/finerweb-10bt

Viewer • Updated Jan 17 • 7.1M • 248 • 6

liked a Space 3 months ago

Whisper JAX

liked a dataset 3 months ago

batubayk/TR-News

Viewer • Updated Mar 4, 2023 • 308k • 173 • 11

liked a dataset 4 months ago

dvilasuero/french-news-classification

Viewer • Updated Dec 16, 2024 • 10 • 61 • 2