MARTINI_enrich_BERTopic_SunflowerSociety

This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.

Usage

To use this model, please install BERTopic:

pip install -U bertopic

You can use the model as follows:

from bertopic import BERTopic
topic_model = BERTopic.load("AIDA-UPM/MARTINI_enrich_BERTopic_SunflowerSociety")

topic_model.get_topic_info()

Topic overview

  • Number of topics: 41
  • Number of training documents: 5874
Click here for an overview of all topics.
Topic ID Topic Keywords Topic Frequency Label
-1 covid - america - whites - christian - never 20 -1_covid_america_whites_christian
0 retweeted - musk - followers - libtards - shadowbanned 3723 0_retweeted_musk_followers_libtards
1 openai - links - llm - 2023 - robots 138 1_openai_links_llm_2023
2 japan - shogunate - yamaguchi - kishida - ultranationalist 120 2_japan_shogunate_yamaguchi_kishida
3 feminism - manosphere - mothers - marry - nana 119 3_feminism_manosphere_mothers_marry
4 souls - poet - yonder - ruskin - stillness 109 4_souls_poet_yonder_ruskin
5 photorealism - cyberpunk - midjourney - warzone - neon 102 5_photorealism_cyberpunk_midjourney_warzone
6 jews - antisemitic - netanyahu - israelis - persecuted 96 6_jews_antisemitic_netanyahu_israelis
7 ai - propaganda - humanity - imagination - centralized 90 7_ai_propaganda_humanity_imagination
8 goddamn - apawgus - outsmarted - standups - shhh 80 8_goddamn_apawgus_outsmarted_standups
9 testosterone - estrogenic - vitamin - sunscreens - metabolic 79 9_testosterone_estrogenic_vitamin_sunscreens
10 mcluhan - modernity - decentralization - metaverse - trollgaze 78 10_mcluhan_modernity_decentralization_metaverse
11 sunflowersocietymag - submissions - january - vanguardist - dunsany 76 11_sunflowersocietymag_submissions_january_vanguardist
12 calvinists - baptists - puritan - kjv - catechism 71 12_calvinists_baptists_puritan_kjv
13 dimeschild - dissident - mongols - latest - societies 67 13_dimeschild_dissident_mongols_latest
14 trudeau - canadians - alberta - multiculturalism - condemns 63 14_trudeau_canadians_alberta_multiculturalism
15 immigration - canadians - trudeau - newcomers - minister 57 15_immigration_canadians_trudeau_newcomers
16 vaccinated - paxlovid - bivalent - boosters - reinfection 56 16_vaccinated_paxlovid_bivalent_boosters
17 fomc - dedollarization - usinflationcalculator - hyperinflated - banknotes 51 17_fomc_dedollarization_usinflationcalculator_hyperinflated
18 leftists - fascism - liberalism - brainwashed - transgenderism 45 18_leftists_fascism_liberalism_brainwashed
19 transphobe - transgenders - dysphoria - lgbtqp - lobotomies 43 19_transphobe_transgenders_dysphoria_lgbtqp
20 democrats - ballots - reaganism - gop - midterms 43 20_democrats_ballots_reaganism_gop
21 zelensky - russia - war - prigozhin - tanks 41 21_zelensky_russia_war_prigozhin
22 immigrants - majority - assimilation - demographic - multiracial 40 22_immigrants_majority_assimilation_demographic
23 inbreeding - miscegenation - autosomal - phenotype - haplogroup 39 23_inbreeding_miscegenation_autosomal_phenotype
24 bidenomics - joe - kamala - impeachment - announced 36 24_bidenomics_joe_kamala_impeachment
25 shortages - electric - gasbuddy - lithium - euros 32 25_shortages_electric_gasbuddy_lithium
26 illegals - tanaiste - westmeath - lampedusa - rwanda 31 26_illegals_tanaiste_westmeath_lampedusa
27 monkeypox - gonorrhoea - aids - prevalence - homosexualists 29 27_monkeypox_gonorrhoea_aids_prevalence
28 birthrate - abortions - russia - decreasing - ourworldindata 28 28_birthrate_abortions_russia_decreasing
29 illegals - texas - reynosa - border - mcallen 28 29_illegals_texas_reynosa_border
30 trump - fbi - pelosi - arrested - prosecution 28 30_trump_fbi_pelosi_arrested
31 livestream - 10pm - millenniyule - episodes - countdown 27 31_livestream_10pm_millenniyule_episodes
32 vaccinated - fauci - misinformation - australia - statins 26 32_vaccinated_fauci_misinformation_australia
33 psalm - micah - canaan - keepeth - hypocrites 26 33_psalm_micah_canaan_keepeth
34 millennials - zoomer - posterity - pessimists - mcmansion 26 34_millennials_zoomer_posterity_pessimists
35 prosecuted - rape - pakistanis - englishman - hatred 24 35_prosecuted_rape_pakistanis_englishman
36 insects - parasites - cannibalize - locust - eat 22 36_insects_parasites_cannibalize_locust
37 france - riots - hollande - arrested - migrants 22 37_france_riots_hollande_arrested
38 thanks - subscribers - youtube - shitpost - 2600登録者をありかとうこさいます 22 38_thanks_subscribers_youtube_shitpost
39 meteorologists - arkstorm - hoax - ozone - evacuate 21 39_meteorologists_arkstorm_hoax_ozone

Training hyperparameters

  • calculate_probabilities: True
  • language: None
  • low_memory: False
  • min_topic_size: 10
  • n_gram_range: (1, 1)
  • nr_topics: None
  • seed_topic_list: None
  • top_n_words: 10
  • verbose: False
  • zeroshot_min_similarity: 0.7
  • zeroshot_topic_list: None

Framework versions

  • Numpy: 1.26.4
  • HDBSCAN: 0.8.40
  • UMAP: 0.5.7
  • Pandas: 2.2.3
  • Scikit-Learn: 1.5.2
  • Sentence-transformers: 3.3.1
  • Transformers: 4.46.3
  • Numba: 0.60.0
  • Plotly: 5.24.1
  • Python: 3.10.12
Downloads last month
2
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.