Tollef J's picture

Tollef J

tollefj

·

https://folk.ntnu.no/tollefj/

tollefj

AI & ML interests

Coreference resolution, span prediction, summarization, topic modeling

Recent Activity

liked a model 4 days ago

google/gemma-3-27b-it

liked a model 4 days ago

google/gemma-3-12b-it

liked a model 4 days ago

google/gemma-3-4b-it

View all activity

Organizations

tollefj's activity

liked 3 models 4 days ago

google/gemma-3-27b-it

Image-Text-to-Text • Updated 4 days ago • 190k • 660

google/gemma-3-12b-it

Image-Text-to-Text • Updated 4 days ago • 65.6k • 199

google/gemma-3-4b-it

Image-Text-to-Text • Updated 4 days ago • 79.8k • 207

upvoted a collection 4 days ago

Gemma 3 Release

9 items • Updated 3 days ago • 252

commented on Introducing EuroBERT: A High-Performance Multilingual Encoder Model 6 days ago

Why are there so few languages involved in the training of these models? You argue that this data mix was selected "to create a corpus of European and most widely spoken languages, representing a broad range of alphabets and cultures."
But what is the relevance in other alphabets when, for example, you do not include any Nordic languages with large and high-quality datasets?

Prefixing it "Euro" seems odd in this context. You have selected a tiny fraction of languages - so name it accordingly :-)
It would also make sense to refer to EuroEval https://euroeval.com/leaderboards/

commented a paper 20 days ago

SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published 24 days ago • 93 •

updated a model about 1 month ago

tollefj/nordavind-llama-3.1-8b-v1

Text Generation • Updated Feb 10 • 6

published a model about 1 month ago

tollefj/nordavind-llama-3.1-8b-v1

Text Generation • Updated Feb 10 • 6

updated a Space about 1 month ago

Siktsok

Søk i trivia-omformet data fra SIKTs nettsteder med openvino

published a Space about 1 month ago

Siktsok

Søk i trivia-omformet data fra SIKTs nettsteder med openvino

New activity in answerdotai/ModernBERT-base about 2 months ago

Performance vs the original architecture on approximate original data sizes (BooksCorpus/Wikipedia)

#54 opened about 2 months ago by

liked a dataset 2 months ago

HPLT/HPLT2.0_cleaned

Viewer • Updated Jan 8 • 10.6B • 142k • 15

liked 2 models 3 months ago

ymcki/Llama-3_1-Nemotron-51B-Instruct-GGUF

Text Generation • Updated Jan 18 • 823 • 12

Datou1111/shou_xin

Text-to-Image • Updated about 2 hours ago • 1.82k • 867

upvoted a collection 3 months ago

NB-Llama 3.x

NOTE: CURRENTLY THERE ARE CONVERTION-ERRORS IN THIS MODELS. TEMPORARY PUT OFFLINE. LLama 3.x models in various sizes. • 9 items • Updated Feb 6 • 2

liked a model 3 months ago

norallm/normistral-11b-warm

Text Generation • Updated 6 days ago • 765 • 6

liked a model 4 months ago

alibaba-damo/mgp-str-base

Image-to-Text • Updated Dec 11, 2023 • 7.79k • 64

New activity in sentence-transformers/all-MiniLM-L6-v2 4 months ago

ignore this

#90 opened 4 months ago by

updated a model 4 months ago

sentence-transformers/all-MiniLM-L6-v2

Sentence Similarity • Updated 10 days ago • 98.9M • • 3.13k

liked a dataset 4 months ago

microsoft/orca-agentinstruct-1M-v1

Viewer • Updated Nov 1, 2024 • 1.05M • 10.8k • 431