MARTINI_enrich_BERTopic_SunflowerSociety
This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.
Usage
To use this model, please install BERTopic:
pip install -U bertopic
You can use the model as follows:
from bertopic import BERTopic
topic_model = BERTopic.load("AIDA-UPM/MARTINI_enrich_BERTopic_SunflowerSociety")
topic_model.get_topic_info()
Topic overview
- Number of topics: 41
- Number of training documents: 5874
Click here for an overview of all topics.
Topic ID | Topic Keywords | Topic Frequency | Label |
---|---|---|---|
-1 | covid - america - whites - christian - never | 20 | -1_covid_america_whites_christian |
0 | retweeted - musk - followers - libtards - shadowbanned | 3723 | 0_retweeted_musk_followers_libtards |
1 | openai - links - llm - 2023 - robots | 138 | 1_openai_links_llm_2023 |
2 | japan - shogunate - yamaguchi - kishida - ultranationalist | 120 | 2_japan_shogunate_yamaguchi_kishida |
3 | feminism - manosphere - mothers - marry - nana | 119 | 3_feminism_manosphere_mothers_marry |
4 | souls - poet - yonder - ruskin - stillness | 109 | 4_souls_poet_yonder_ruskin |
5 | photorealism - cyberpunk - midjourney - warzone - neon | 102 | 5_photorealism_cyberpunk_midjourney_warzone |
6 | jews - antisemitic - netanyahu - israelis - persecuted | 96 | 6_jews_antisemitic_netanyahu_israelis |
7 | ai - propaganda - humanity - imagination - centralized | 90 | 7_ai_propaganda_humanity_imagination |
8 | goddamn - apawgus - outsmarted - standups - shhh | 80 | 8_goddamn_apawgus_outsmarted_standups |
9 | testosterone - estrogenic - vitamin - sunscreens - metabolic | 79 | 9_testosterone_estrogenic_vitamin_sunscreens |
10 | mcluhan - modernity - decentralization - metaverse - trollgaze | 78 | 10_mcluhan_modernity_decentralization_metaverse |
11 | sunflowersocietymag - submissions - january - vanguardist - dunsany | 76 | 11_sunflowersocietymag_submissions_january_vanguardist |
12 | calvinists - baptists - puritan - kjv - catechism | 71 | 12_calvinists_baptists_puritan_kjv |
13 | dimeschild - dissident - mongols - latest - societies | 67 | 13_dimeschild_dissident_mongols_latest |
14 | trudeau - canadians - alberta - multiculturalism - condemns | 63 | 14_trudeau_canadians_alberta_multiculturalism |
15 | immigration - canadians - trudeau - newcomers - minister | 57 | 15_immigration_canadians_trudeau_newcomers |
16 | vaccinated - paxlovid - bivalent - boosters - reinfection | 56 | 16_vaccinated_paxlovid_bivalent_boosters |
17 | fomc - dedollarization - usinflationcalculator - hyperinflated - banknotes | 51 | 17_fomc_dedollarization_usinflationcalculator_hyperinflated |
18 | leftists - fascism - liberalism - brainwashed - transgenderism | 45 | 18_leftists_fascism_liberalism_brainwashed |
19 | transphobe - transgenders - dysphoria - lgbtqp - lobotomies | 43 | 19_transphobe_transgenders_dysphoria_lgbtqp |
20 | democrats - ballots - reaganism - gop - midterms | 43 | 20_democrats_ballots_reaganism_gop |
21 | zelensky - russia - war - prigozhin - tanks | 41 | 21_zelensky_russia_war_prigozhin |
22 | immigrants - majority - assimilation - demographic - multiracial | 40 | 22_immigrants_majority_assimilation_demographic |
23 | inbreeding - miscegenation - autosomal - phenotype - haplogroup | 39 | 23_inbreeding_miscegenation_autosomal_phenotype |
24 | bidenomics - joe - kamala - impeachment - announced | 36 | 24_bidenomics_joe_kamala_impeachment |
25 | shortages - electric - gasbuddy - lithium - euros | 32 | 25_shortages_electric_gasbuddy_lithium |
26 | illegals - tanaiste - westmeath - lampedusa - rwanda | 31 | 26_illegals_tanaiste_westmeath_lampedusa |
27 | monkeypox - gonorrhoea - aids - prevalence - homosexualists | 29 | 27_monkeypox_gonorrhoea_aids_prevalence |
28 | birthrate - abortions - russia - decreasing - ourworldindata | 28 | 28_birthrate_abortions_russia_decreasing |
29 | illegals - texas - reynosa - border - mcallen | 28 | 29_illegals_texas_reynosa_border |
30 | trump - fbi - pelosi - arrested - prosecution | 28 | 30_trump_fbi_pelosi_arrested |
31 | livestream - 10pm - millenniyule - episodes - countdown | 27 | 31_livestream_10pm_millenniyule_episodes |
32 | vaccinated - fauci - misinformation - australia - statins | 26 | 32_vaccinated_fauci_misinformation_australia |
33 | psalm - micah - canaan - keepeth - hypocrites | 26 | 33_psalm_micah_canaan_keepeth |
34 | millennials - zoomer - posterity - pessimists - mcmansion | 26 | 34_millennials_zoomer_posterity_pessimists |
35 | prosecuted - rape - pakistanis - englishman - hatred | 24 | 35_prosecuted_rape_pakistanis_englishman |
36 | insects - parasites - cannibalize - locust - eat | 22 | 36_insects_parasites_cannibalize_locust |
37 | france - riots - hollande - arrested - migrants | 22 | 37_france_riots_hollande_arrested |
38 | thanks - subscribers - youtube - shitpost - 2600登録者をありかとうこさいます | 22 | 38_thanks_subscribers_youtube_shitpost |
39 | meteorologists - arkstorm - hoax - ozone - evacuate | 21 | 39_meteorologists_arkstorm_hoax_ozone |
Training hyperparameters
- calculate_probabilities: True
- language: None
- low_memory: False
- min_topic_size: 10
- n_gram_range: (1, 1)
- nr_topics: None
- seed_topic_list: None
- top_n_words: 10
- verbose: False
- zeroshot_min_similarity: 0.7
- zeroshot_topic_list: None
Framework versions
- Numpy: 1.26.4
- HDBSCAN: 0.8.40
- UMAP: 0.5.7
- Pandas: 2.2.3
- Scikit-Learn: 1.5.2
- Sentence-transformers: 3.3.1
- Transformers: 4.46.3
- Numba: 0.60.0
- Plotly: 5.24.1
- Python: 3.10.12
- Downloads last month
- 4
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.