MARTINI_enrich_BERTopic_SunflowerSociety

This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.

Usage

To use this model, please install BERTopic:

pip install -U bertopic

You can use the model as follows:

from bertopic import BERTopic
topic_model = BERTopic.load("AIDA-UPM/MARTINI_enrich_BERTopic_SunflowerSociety")

topic_model.get_topic_info()

Topic overview

  • Number of topics: 41
  • Number of training documents: 5874
Click here for an overview of all topics.
Topic ID Topic Keywords Topic Frequency Label
-1 covid - america - whites - christian - never 20 -1_covid_america_whites_christian
0 retweeted - musk - followers - libtards - shadowbanned 3723 0_retweeted_musk_followers_libtards
1 openai - links - llm - 2023 - robots 138 1_openai_links_llm_2023
2 japan - shogunate - yamaguchi - kishida - ultranationalist 120 2_japan_shogunate_yamaguchi_kishida
3 feminism - manosphere - mothers - marry - nana 119 3_feminism_manosphere_mothers_marry
4 souls - poet - yonder - ruskin - stillness 109 4_souls_poet_yonder_ruskin
5 photorealism - cyberpunk - midjourney - warzone - neon 102 5_photorealism_cyberpunk_midjourney_warzone
6 jews - antisemitic - netanyahu - israelis - persecuted 96 6_jews_antisemitic_netanyahu_israelis
7 ai - propaganda - humanity - imagination - centralized 90 7_ai_propaganda_humanity_imagination
8 goddamn - apawgus - outsmarted - standups - shhh 80 8_goddamn_apawgus_outsmarted_standups
9 testosterone - estrogenic - vitamin - sunscreens - metabolic 79 9_testosterone_estrogenic_vitamin_sunscreens
10 mcluhan - modernity - decentralization - metaverse - trollgaze 78 10_mcluhan_modernity_decentralization_metaverse
11 sunflowersocietymag - submissions - january - vanguardist - dunsany 76 11_sunflowersocietymag_submissions_january_vanguardist
12 calvinists - baptists - puritan - kjv - catechism 71 12_calvinists_baptists_puritan_kjv
13 dimeschild - dissident - mongols - latest - societies 67 13_dimeschild_dissident_mongols_latest
14 trudeau - canadians - alberta - multiculturalism - condemns 63 14_trudeau_canadians_alberta_multiculturalism
15 immigration - canadians - trudeau - newcomers - minister 57 15_immigration_canadians_trudeau_newcomers
16 vaccinated - paxlovid - bivalent - boosters - reinfection 56 16_vaccinated_paxlovid_bivalent_boosters
17 fomc - dedollarization - usinflationcalculator - hyperinflated - banknotes 51 17_fomc_dedollarization_usinflationcalculator_hyperinflated
18 leftists - fascism - liberalism - brainwashed - transgenderism 45 18_leftists_fascism_liberalism_brainwashed
19 transphobe - transgenders - dysphoria - lgbtqp - lobotomies 43 19_transphobe_transgenders_dysphoria_lgbtqp
20 democrats - ballots - reaganism - gop - midterms 43 20_democrats_ballots_reaganism_gop
21 zelensky - russia - war - prigozhin - tanks 41 21_zelensky_russia_war_prigozhin
22 immigrants - majority - assimilation - demographic - multiracial 40 22_immigrants_majority_assimilation_demographic
23 inbreeding - miscegenation - autosomal - phenotype - haplogroup 39 23_inbreeding_miscegenation_autosomal_phenotype
24 bidenomics - joe - kamala - impeachment - announced 36 24_bidenomics_joe_kamala_impeachment
25 shortages - electric - gasbuddy - lithium - euros 32 25_shortages_electric_gasbuddy_lithium
26 illegals - tanaiste - westmeath - lampedusa - rwanda 31 26_illegals_tanaiste_westmeath_lampedusa
27 monkeypox - gonorrhoea - aids - prevalence - homosexualists 29 27_monkeypox_gonorrhoea_aids_prevalence
28 birthrate - abortions - russia - decreasing - ourworldindata 28 28_birthrate_abortions_russia_decreasing
29 illegals - texas - reynosa - border - mcallen 28 29_illegals_texas_reynosa_border
30 trump - fbi - pelosi - arrested - prosecution 28 30_trump_fbi_pelosi_arrested
31 livestream - 10pm - millenniyule - episodes - countdown 27 31_livestream_10pm_millenniyule_episodes
32 vaccinated - fauci - misinformation - australia - statins 26 32_vaccinated_fauci_misinformation_australia
33 psalm - micah - canaan - keepeth - hypocrites 26 33_psalm_micah_canaan_keepeth
34 millennials - zoomer - posterity - pessimists - mcmansion 26 34_millennials_zoomer_posterity_pessimists
35 prosecuted - rape - pakistanis - englishman - hatred 24 35_prosecuted_rape_pakistanis_englishman
36 insects - parasites - cannibalize - locust - eat 22 36_insects_parasites_cannibalize_locust
37 france - riots - hollande - arrested - migrants 22 37_france_riots_hollande_arrested
38 thanks - subscribers - youtube - shitpost - 2600登録者をありかとうこさいます 22 38_thanks_subscribers_youtube_shitpost
39 meteorologists - arkstorm - hoax - ozone - evacuate 21 39_meteorologists_arkstorm_hoax_ozone

Training hyperparameters

  • calculate_probabilities: True
  • language: None
  • low_memory: False
  • min_topic_size: 10
  • n_gram_range: (1, 1)
  • nr_topics: None
  • seed_topic_list: None
  • top_n_words: 10
  • verbose: False
  • zeroshot_min_similarity: 0.7
  • zeroshot_topic_list: None

Framework versions

  • Numpy: 1.26.4
  • HDBSCAN: 0.8.40
  • UMAP: 0.5.7
  • Pandas: 2.2.3
  • Scikit-Learn: 1.5.2
  • Sentence-transformers: 3.3.1
  • Transformers: 4.46.3
  • Numba: 0.60.0
  • Plotly: 5.24.1
  • Python: 3.10.12
Downloads last month
4
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.