MARTINI_enrich_BERTopic_thegoldenone

This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.

Usage

To use this model, please install BERTopic:

pip install -U bertopic

You can use the model as follows:

from bertopic import BERTopic
topic_model = BERTopic.load("AIDA-UPM/MARTINI_enrich_BERTopic_thegoldenone")

topic_model.get_topic_info()

Topic overview

  • Number of topics: 20
  • Number of training documents: 2440
Click here for an overview of all topics.
Topic ID Topic Keywords Topic Frequency Label
-1 civilisation - guys - nietzsche - sweden - white 20 -1_civilisation_guys_nietzsche_sweden
0 legiogloria - polos - garment - tracksuit - bottoms 1332 0_legiogloria_polos_garment_tracksuit
1 neolithic - aryans - haplogroup - norse - europe 109 1_neolithic_aryans_haplogroup_norse
2 instagram - deleted - parler - banned - censorship 109 2_instagram_deleted_parler_banned
3 sweden - fredrik - migrants - riots - socialism 91 3_sweden_fredrik_migrants_riots
4 mithras - deities - hermeticism - rosicrucians - angels 86 4_mithras_deities_hermeticism_rosicrucians
5 youtube - views - bitchute - subscribers - deleted 79 5_youtube_views_bitchute_subscribers
6 collagen - jotunheimnutrition - whey - supplement - creatine 69 6_collagen_jotunheimnutrition_whey_supplement
7 benchpress - deadlifts - squats - kettlebell - stronger 68 7_benchpress_deadlifts_squats_kettlebell
8 racist - leftists - nazis - fuck - accelerationism 59 8_racist_leftists_nazis_fuck
9 podcast - ultimate - theodoric - demigod - civilisation 55 9_podcast_ultimate_theodoric_demigod
10 aragorn - morgoth - eltharion - valiant - gawain 55 10_aragorn_morgoth_eltharion_valiant
11 dauntless - goodreads - demigod - marcus - appreciated 51 11_dauntless_goodreads_demigod_marcus
12 delays - shipped - postnord - delivering - dhl 50 12_delays_shipped_postnord_delivering
13 ukraine - donetsk - russians - serbs - kremlin 50 13_ukraine_donetsk_russians_serbs
14 physiognomy - masculine - strength - sociosexual - unattractive 49 14_physiognomy_masculine_strength_sociosexual
15 kickboxing - thaiboxing - heavybag - grappling - opponents 43 15_kickboxing_thaiboxing_heavybag_grappling
16 reading - machiavelli - homoerotic - chapter - diabolical 24 16_reading_machiavelli_homoerotic_chapter
17 fizeekfriday - fitness - spiritually - gains - momoa 21 17_fizeekfriday_fitness_spiritually_gains
18 coffees - kaffekompaniet - fairtrade - lindt - buttermaxxed 20 18_coffees_kaffekompaniet_fairtrade_lindt

Training hyperparameters

  • calculate_probabilities: True
  • language: None
  • low_memory: False
  • min_topic_size: 10
  • n_gram_range: (1, 1)
  • nr_topics: None
  • seed_topic_list: None
  • top_n_words: 10
  • verbose: False
  • zeroshot_min_similarity: 0.7
  • zeroshot_topic_list: None

Framework versions

  • Numpy: 1.26.4
  • HDBSCAN: 0.8.40
  • UMAP: 0.5.7
  • Pandas: 2.2.3
  • Scikit-Learn: 1.5.2
  • Sentence-transformers: 3.3.1
  • Transformers: 4.46.3
  • Numba: 0.60.0
  • Plotly: 5.24.1
  • Python: 3.10.12
Downloads last month
4
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.