MARTINI_enrich_BERTopic_GeorgeWebb
This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.
Usage
To use this model, please install BERTopic:
pip install -U bertopic
You can use the model as follows:
from bertopic import BERTopic
topic_model = BERTopic.load("AIDA-UPM/MARTINI_enrich_BERTopic_GeorgeWebb")
topic_model.get_topic_info()
Topic overview
- Number of topics: 45
- Number of training documents: 5241
Click here for an overview of all topics.
Topic ID | Topic Keywords | Topic Frequency | Label |
---|---|---|---|
-1 | remdesivir - pfizer - 2020 - substackcdn - ukraine | 20 | -1_remdesivir_pfizer_2020_substackcdn |
0 | housatoniclive - bitchute - youtube - fauci - quanrud | 3012 | 0_housatoniclive_bitchute_youtube_fauci |
1 | thebiden - blackberry - investigators - cooney - subpoena | 144 | 1_thebiden_blackberry_investigators_cooney |
2 | journos - wapo - tweet - liberal - sentinel | 137 | 2_journos_wapo_tweet_liberal |
3 | anthrax - rumsfeld - qanon - vaccine - bombings | 124 | 3_anthrax_rumsfeld_qanon_vaccine |
4 | malone - bannon - vaccine - robert - retweeting | 123 | 4_malone_bannon_vaccine_robert |
5 | trump - doj - kasowitz - unredacted - informant | 106 | 5_trump_doj_kasowitz_unredacted |
6 | webinar - weibo - live - newswoman - channel | 81 | 6_webinar_weibo_live_newswoman |
7 | informants - superpac - voting - conroe - hecker | 78 | 7_informants_superpac_voting_conroe |
8 | tracy - operating - youtube - cuomo - 3mnbwxudjff9uokvr5ysj1sovv8fjr8nqw | 78 | 8_tracy_operating_youtube_cuomo |
9 | wuhan - corinavirus - blackwater - superspreader - september | 73 | 9_wuhan_corinavirus_blackwater_superspreader |
10 | nazis - kissinger - schmidt - davos - swiss | 68 | 10_nazis_kissinger_schmidt_davos |
11 | h5n1 - fouchier - sars - transmissibility - xmrvs | 68 | 11_h5n1_fouchier_sars_transmissibility |
12 | mcdufflivestwo - oloughlin - thelaroucheorganization - neighborhoodnewsstudio - nixon | 64 | 12_mcdufflivestwo_oloughlin_thelaroucheorganization_neighborhoodnewsstudio |
13 | webbs - livestream - rumble - joined - dissidents | 59 | 13_webbs_livestream_rumble_joined |
14 | neighborhoodnewsstudio - corona - bitchute - websites - channels | 56 | 14_neighborhoodnewsstudio_corona_bitchute_websites |
15 | dnc - substackcdn - kissinger - orange - anthrax | 56 | 15_dnc_substackcdn_kissinger_orange |
16 | coronavirus - wuhan - cnn - virginia - 2020 | 56 | 16_coronavirus_wuhan_cnn_virginia |
17 | nordstream - kherson - kissinger - popodopoulos - venetian | 47 | 17_nordstream_kherson_kissinger_popodopoulos |
18 | bioweapons - ukraine - kirillov - putin - pentagon | 45 | 18_bioweapons_ukraine_kirillov_putin |
19 | mcdufflivestwo - mindfields - dictators - yandell - thornton | 45 | 19_mcdufflivestwo_mindfields_dictators_yandell |
20 | azov - putin - kosovo - hostage - mujahideen | 43 | 20_azov_putin_kosovo_hostage |
21 | mcdufflivestwo - wednesday - underdown - youtube - housatonicits | 41 | 21_mcdufflivestwo_wednesday_underdown_youtube |
22 | cnn - jeff - vigilante - smeared - misrepresentations | 39 | 22_cnn_jeff_vigilante_smeared |
23 | coronavirusmcnutt - wuhan - webinar - huff - zoonotic | 33 | 23_coronavirusmcnutt_wuhan_webinar_huff |
24 | fbi - informants - milley - arrested - january | 31 | 24_fbi_informants_milley_arrested |
25 | news - nnhour - brandnewtube - bitchute - adds | 31 | 25_news_nnhour_brandnewtube_bitchute |
26 | electrocardiogram - myocarditis - mcg - tetralogy - screened | 31 | 26_electrocardiogram_myocarditis_mcg_tetralogy |
27 | remdesivir - hydroxychloroquine - tamiflu - alabama - famotadine | 31 | 27_remdesivir_hydroxychloroquine_tamiflu_alabama |
28 | malone - robert - ivermectin - vaccine - darpa | 31 | 28_malone_robert_ivermectin_vaccine |
29 | dealey - lbj - kennedy - connally - texas | 31 | 29_dealey_lbj_kennedy_connally |
30 | petraeus - clinton - kosovo - covert - 2fbucketeer | 30 | 30_petraeus_clinton_kosovo_covert |
31 | mcdufflivestwo - chat - youtube - fogarty - 2022 | 30 | 31_mcdufflivestwo_chat_youtube_fogarty |
32 | pfizerorgchart - whistleblower - jordan - mckenzie - oswaldo | 29 | 32_pfizerorgchart_whistleblower_jordan_mckenzie |
33 | virologist - benassi - stuxnet - nanjing - nanoferretin | 28 | 33_virologist_benassi_stuxnet_nanjing |
34 | transnistria - ukrainians - skynews - krakow - nawaz | 28 | 34_transnistria_ukrainians_skynews_krakow |
35 | corona - live - telegram - sweigert - darpa | 26 | 35_corona_live_telegram_sweigert |
36 | giuseppe - pope - libya - spying - tiabbi | 26 | 36_giuseppe_pope_libya_spying |
37 | hpv - turbocancer - telomerase - hauptman - p53s | 26 | 37_hpv_turbocancer_telomerase_hauptman |
38 | pfizerorg - glaxo - remdesivir - kellor - whitepaper | 24 | 38_pfizerorg_glaxo_remdesivir_kellor |
39 | russiagate - doxxing - ssassin - suppressed - hale | 24 | 39_russiagate_doxxing_ssassin_suppressed |
40 | mcduff - neighborhoodnewsstudio - hamamamoto - youtube - screamingospreys | 24 | 40_mcduff_neighborhoodnewsstudio_hamamamoto_youtube |
41 | zelensky - mariupol - oligarchs - kolomoisky - billion | 22 | 41_zelensky_mariupol_oligarchs_kolomoisky |
42 | journos - webinar - peter - duke - monaghan | 21 | 42_journos_webinar_peter_duke |
43 | skripal - gubarev - steele - manafort - dossier | 21 | 43_skripal_gubarev_steele_manafort |
Training hyperparameters
- calculate_probabilities: True
- language: None
- low_memory: False
- min_topic_size: 10
- n_gram_range: (1, 1)
- nr_topics: None
- seed_topic_list: None
- top_n_words: 10
- verbose: False
- zeroshot_min_similarity: 0.7
- zeroshot_topic_list: None
Framework versions
- Numpy: 1.26.4
- HDBSCAN: 0.8.40
- UMAP: 0.5.7
- Pandas: 2.2.3
- Scikit-Learn: 1.5.2
- Sentence-transformers: 3.3.1
- Transformers: 4.46.3
- Numba: 0.60.0
- Plotly: 5.24.1
- Python: 3.10.12
- Downloads last month
- 4
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
the model is not deployed on the HF Inference API.