MARTINI_enrich_BERTopic_GeorgeWebb

This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.

Usage

To use this model, please install BERTopic:

pip install -U bertopic

You can use the model as follows:

from bertopic import BERTopic
topic_model = BERTopic.load("AIDA-UPM/MARTINI_enrich_BERTopic_GeorgeWebb")

topic_model.get_topic_info()

Topic overview

  • Number of topics: 45
  • Number of training documents: 5241
Click here for an overview of all topics.
Topic ID Topic Keywords Topic Frequency Label
-1 remdesivir - pfizer - 2020 - substackcdn - ukraine 20 -1_remdesivir_pfizer_2020_substackcdn
0 housatoniclive - bitchute - youtube - fauci - quanrud 3012 0_housatoniclive_bitchute_youtube_fauci
1 thebiden - blackberry - investigators - cooney - subpoena 144 1_thebiden_blackberry_investigators_cooney
2 journos - wapo - tweet - liberal - sentinel 137 2_journos_wapo_tweet_liberal
3 anthrax - rumsfeld - qanon - vaccine - bombings 124 3_anthrax_rumsfeld_qanon_vaccine
4 malone - bannon - vaccine - robert - retweeting 123 4_malone_bannon_vaccine_robert
5 trump - doj - kasowitz - unredacted - informant 106 5_trump_doj_kasowitz_unredacted
6 webinar - weibo - live - newswoman - channel 81 6_webinar_weibo_live_newswoman
7 informants - superpac - voting - conroe - hecker 78 7_informants_superpac_voting_conroe
8 tracy - operating - youtube - cuomo - 3mnbwxudjff9uokvr5ysj1sovv8fjr8nqw 78 8_tracy_operating_youtube_cuomo
9 wuhan - corinavirus - blackwater - superspreader - september 73 9_wuhan_corinavirus_blackwater_superspreader
10 nazis - kissinger - schmidt - davos - swiss 68 10_nazis_kissinger_schmidt_davos
11 h5n1 - fouchier - sars - transmissibility - xmrvs 68 11_h5n1_fouchier_sars_transmissibility
12 mcdufflivestwo - oloughlin - thelaroucheorganization - neighborhoodnewsstudio - nixon 64 12_mcdufflivestwo_oloughlin_thelaroucheorganization_neighborhoodnewsstudio
13 webbs - livestream - rumble - joined - dissidents 59 13_webbs_livestream_rumble_joined
14 neighborhoodnewsstudio - corona - bitchute - websites - channels 56 14_neighborhoodnewsstudio_corona_bitchute_websites
15 dnc - substackcdn - kissinger - orange - anthrax 56 15_dnc_substackcdn_kissinger_orange
16 coronavirus - wuhan - cnn - virginia - 2020 56 16_coronavirus_wuhan_cnn_virginia
17 nordstream - kherson - kissinger - popodopoulos - venetian 47 17_nordstream_kherson_kissinger_popodopoulos
18 bioweapons - ukraine - kirillov - putin - pentagon 45 18_bioweapons_ukraine_kirillov_putin
19 mcdufflivestwo - mindfields - dictators - yandell - thornton 45 19_mcdufflivestwo_mindfields_dictators_yandell
20 azov - putin - kosovo - hostage - mujahideen 43 20_azov_putin_kosovo_hostage
21 mcdufflivestwo - wednesday - underdown - youtube - housatonicits 41 21_mcdufflivestwo_wednesday_underdown_youtube
22 cnn - jeff - vigilante - smeared - misrepresentations 39 22_cnn_jeff_vigilante_smeared
23 coronavirusmcnutt - wuhan - webinar - huff - zoonotic 33 23_coronavirusmcnutt_wuhan_webinar_huff
24 fbi - informants - milley - arrested - january 31 24_fbi_informants_milley_arrested
25 news - nnhour - brandnewtube - bitchute - adds 31 25_news_nnhour_brandnewtube_bitchute
26 electrocardiogram - myocarditis - mcg - tetralogy - screened 31 26_electrocardiogram_myocarditis_mcg_tetralogy
27 remdesivir - hydroxychloroquine - tamiflu - alabama - famotadine 31 27_remdesivir_hydroxychloroquine_tamiflu_alabama
28 malone - robert - ivermectin - vaccine - darpa 31 28_malone_robert_ivermectin_vaccine
29 dealey - lbj - kennedy - connally - texas 31 29_dealey_lbj_kennedy_connally
30 petraeus - clinton - kosovo - covert - 2fbucketeer 30 30_petraeus_clinton_kosovo_covert
31 mcdufflivestwo - chat - youtube - fogarty - 2022 30 31_mcdufflivestwo_chat_youtube_fogarty
32 pfizerorgchart - whistleblower - jordan - mckenzie - oswaldo 29 32_pfizerorgchart_whistleblower_jordan_mckenzie
33 virologist - benassi - stuxnet - nanjing - nanoferretin 28 33_virologist_benassi_stuxnet_nanjing
34 transnistria - ukrainians - skynews - krakow - nawaz 28 34_transnistria_ukrainians_skynews_krakow
35 corona - live - telegram - sweigert - darpa 26 35_corona_live_telegram_sweigert
36 giuseppe - pope - libya - spying - tiabbi 26 36_giuseppe_pope_libya_spying
37 hpv - turbocancer - telomerase - hauptman - p53s 26 37_hpv_turbocancer_telomerase_hauptman
38 pfizerorg - glaxo - remdesivir - kellor - whitepaper 24 38_pfizerorg_glaxo_remdesivir_kellor
39 russiagate - doxxing - ssassin - suppressed - hale 24 39_russiagate_doxxing_ssassin_suppressed
40 mcduff - neighborhoodnewsstudio - hamamamoto - youtube - screamingospreys 24 40_mcduff_neighborhoodnewsstudio_hamamamoto_youtube
41 zelensky - mariupol - oligarchs - kolomoisky - billion 22 41_zelensky_mariupol_oligarchs_kolomoisky
42 journos - webinar - peter - duke - monaghan 21 42_journos_webinar_peter_duke
43 skripal - gubarev - steele - manafort - dossier 21 43_skripal_gubarev_steele_manafort

Training hyperparameters

  • calculate_probabilities: True
  • language: None
  • low_memory: False
  • min_topic_size: 10
  • n_gram_range: (1, 1)
  • nr_topics: None
  • seed_topic_list: None
  • top_n_words: 10
  • verbose: False
  • zeroshot_min_similarity: 0.7
  • zeroshot_topic_list: None

Framework versions

  • Numpy: 1.26.4
  • HDBSCAN: 0.8.40
  • UMAP: 0.5.7
  • Pandas: 2.2.3
  • Scikit-Learn: 1.5.2
  • Sentence-transformers: 3.3.1
  • Transformers: 4.46.3
  • Numba: 0.60.0
  • Plotly: 5.24.1
  • Python: 3.10.12
Downloads last month
4
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.