MARTINI_enrich_BERTopic_thegoldenone
This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.
Usage
To use this model, please install BERTopic:
pip install -U bertopic
You can use the model as follows:
from bertopic import BERTopic
topic_model = BERTopic.load("AIDA-UPM/MARTINI_enrich_BERTopic_thegoldenone")
topic_model.get_topic_info()
Topic overview
- Number of topics: 20
- Number of training documents: 2440
Click here for an overview of all topics.
Topic ID | Topic Keywords | Topic Frequency | Label |
---|---|---|---|
-1 | civilisation - guys - nietzsche - sweden - white | 20 | -1_civilisation_guys_nietzsche_sweden |
0 | legiogloria - polos - garment - tracksuit - bottoms | 1332 | 0_legiogloria_polos_garment_tracksuit |
1 | neolithic - aryans - haplogroup - norse - europe | 109 | 1_neolithic_aryans_haplogroup_norse |
2 | instagram - deleted - parler - banned - censorship | 109 | 2_instagram_deleted_parler_banned |
3 | sweden - fredrik - migrants - riots - socialism | 91 | 3_sweden_fredrik_migrants_riots |
4 | mithras - deities - hermeticism - rosicrucians - angels | 86 | 4_mithras_deities_hermeticism_rosicrucians |
5 | youtube - views - bitchute - subscribers - deleted | 79 | 5_youtube_views_bitchute_subscribers |
6 | collagen - jotunheimnutrition - whey - supplement - creatine | 69 | 6_collagen_jotunheimnutrition_whey_supplement |
7 | benchpress - deadlifts - squats - kettlebell - stronger | 68 | 7_benchpress_deadlifts_squats_kettlebell |
8 | racist - leftists - nazis - fuck - accelerationism | 59 | 8_racist_leftists_nazis_fuck |
9 | podcast - ultimate - theodoric - demigod - civilisation | 55 | 9_podcast_ultimate_theodoric_demigod |
10 | aragorn - morgoth - eltharion - valiant - gawain | 55 | 10_aragorn_morgoth_eltharion_valiant |
11 | dauntless - goodreads - demigod - marcus - appreciated | 51 | 11_dauntless_goodreads_demigod_marcus |
12 | delays - shipped - postnord - delivering - dhl | 50 | 12_delays_shipped_postnord_delivering |
13 | ukraine - donetsk - russians - serbs - kremlin | 50 | 13_ukraine_donetsk_russians_serbs |
14 | physiognomy - masculine - strength - sociosexual - unattractive | 49 | 14_physiognomy_masculine_strength_sociosexual |
15 | kickboxing - thaiboxing - heavybag - grappling - opponents | 43 | 15_kickboxing_thaiboxing_heavybag_grappling |
16 | reading - machiavelli - homoerotic - chapter - diabolical | 24 | 16_reading_machiavelli_homoerotic_chapter |
17 | fizeekfriday - fitness - spiritually - gains - momoa | 21 | 17_fizeekfriday_fitness_spiritually_gains |
18 | coffees - kaffekompaniet - fairtrade - lindt - buttermaxxed | 20 | 18_coffees_kaffekompaniet_fairtrade_lindt |
Training hyperparameters
- calculate_probabilities: True
- language: None
- low_memory: False
- min_topic_size: 10
- n_gram_range: (1, 1)
- nr_topics: None
- seed_topic_list: None
- top_n_words: 10
- verbose: False
- zeroshot_min_similarity: 0.7
- zeroshot_topic_list: None
Framework versions
- Numpy: 1.26.4
- HDBSCAN: 0.8.40
- UMAP: 0.5.7
- Pandas: 2.2.3
- Scikit-Learn: 1.5.2
- Sentence-transformers: 3.3.1
- Transformers: 4.46.3
- Numba: 0.60.0
- Plotly: 5.24.1
- Python: 3.10.12
- Downloads last month
- 4
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.