MARTINI_enrich_BERTopic_thegoldenone

This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.

Usage

To use this model, please install BERTopic:

pip install -U bertopic

You can use the model as follows:

from bertopic import BERTopic
topic_model = BERTopic.load("AIDA-UPM/MARTINI_enrich_BERTopic_thegoldenone")

topic_model.get_topic_info()

Topic overview

  • Number of topics: 20
  • Number of training documents: 2440
Click here for an overview of all topics.
Topic ID Topic Keywords Topic Frequency Label
-1 civilisation - guys - nietzsche - sweden - white 20 -1_civilisation_guys_nietzsche_sweden
0 legiogloria - polos - garment - tracksuit - bottoms 1332 0_legiogloria_polos_garment_tracksuit
1 neolithic - aryans - haplogroup - norse - europe 109 1_neolithic_aryans_haplogroup_norse
2 instagram - deleted - parler - banned - censorship 109 2_instagram_deleted_parler_banned
3 sweden - fredrik - migrants - riots - socialism 91 3_sweden_fredrik_migrants_riots
4 mithras - deities - hermeticism - rosicrucians - angels 86 4_mithras_deities_hermeticism_rosicrucians
5 youtube - views - bitchute - subscribers - deleted 79 5_youtube_views_bitchute_subscribers
6 collagen - jotunheimnutrition - whey - supplement - creatine 69 6_collagen_jotunheimnutrition_whey_supplement
7 benchpress - deadlifts - squats - kettlebell - stronger 68 7_benchpress_deadlifts_squats_kettlebell
8 racist - leftists - nazis - fuck - accelerationism 59 8_racist_leftists_nazis_fuck
9 podcast - ultimate - theodoric - demigod - civilisation 55 9_podcast_ultimate_theodoric_demigod
10 aragorn - morgoth - eltharion - valiant - gawain 55 10_aragorn_morgoth_eltharion_valiant
11 dauntless - goodreads - demigod - marcus - appreciated 51 11_dauntless_goodreads_demigod_marcus
12 delays - shipped - postnord - delivering - dhl 50 12_delays_shipped_postnord_delivering
13 ukraine - donetsk - russians - serbs - kremlin 50 13_ukraine_donetsk_russians_serbs
14 physiognomy - masculine - strength - sociosexual - unattractive 49 14_physiognomy_masculine_strength_sociosexual
15 kickboxing - thaiboxing - heavybag - grappling - opponents 43 15_kickboxing_thaiboxing_heavybag_grappling
16 reading - machiavelli - homoerotic - chapter - diabolical 24 16_reading_machiavelli_homoerotic_chapter
17 fizeekfriday - fitness - spiritually - gains - momoa 21 17_fizeekfriday_fitness_spiritually_gains
18 coffees - kaffekompaniet - fairtrade - lindt - buttermaxxed 20 18_coffees_kaffekompaniet_fairtrade_lindt

Training hyperparameters

  • calculate_probabilities: True
  • language: None
  • low_memory: False
  • min_topic_size: 10
  • n_gram_range: (1, 1)
  • nr_topics: None
  • seed_topic_list: None
  • top_n_words: 10
  • verbose: False
  • zeroshot_min_similarity: 0.7
  • zeroshot_topic_list: None

Framework versions

  • Numpy: 1.26.4
  • HDBSCAN: 0.8.40
  • UMAP: 0.5.7
  • Pandas: 2.2.3
  • Scikit-Learn: 1.5.2
  • Sentence-transformers: 3.3.1
  • Transformers: 4.46.3
  • Numba: 0.60.0
  • Plotly: 5.24.1
  • Python: 3.10.12
Downloads last month
2
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.