MARTINI_enrich_BERTopic_BBCisTheVIRUS
This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.
Usage
To use this model, please install BERTopic:
pip install -U bertopic
You can use the model as follows:
from bertopic import BERTopic
topic_model = BERTopic.load("AIDA-UPM/MARTINI_enrich_BERTopic_BBCisTheVIRUS")
topic_model.get_topic_info()
Topic overview
- Number of topics: 31
- Number of training documents: 5095
Click here for an overview of all topics.
Topic ID | Topic Keywords | Topic Frequency | Label |
---|---|---|---|
-1 | protest - bbc - truth - freedom - covid | 20 | -1_protest_bbc_truth_freedom |
0 | sticker - printable - sheets - a4 - canva | 3288 | 0_sticker_printable_sheets_a4 |
1 | bbcisthevirus - march - lies - nhk - 28th | 149 | 1_bbcisthevirus_march_lies_nhk |
2 | bbc - belfast - nottingham - wales - salford | 141 | 2_bbc_belfast_nottingham_wales |
3 | bbc - uncovered - bombshell - reporters - whores | 127 | 3_bbc_uncovered_bombshell_reporters |
4 | meetup - attending - idk - cancelled - date | 119 | 4_meetup_attending_idk_cancelled |
5 | arrested - cops - policeman - constables - idiots | 116 | 5_arrested_cops_policeman_constables |
6 | vigimedias - presse - propaganda - fevrier - souverainete | 107 | 6_vigimedias_presse_propaganda_fevrier |
7 | vaccinated - injectuon - mumps - nhs - colds | 87 | 7_vaccinated_injectuon_mumps_nhs |
8 | spammers - messages - deleted - admin - teligram | 80 | 8_spammers_messages_deleted_admin |
9 | manchester - posters - wednesday - 11am - picnic | 78 | 9_manchester_posters_wednesday_11am |
10 | deleted - links - messages - hmm - video | 68 | 10_deleted_links_messages_hmm |
11 | infighting - clown - normie - pointless - sh1t | 65 | 11_infighting_clown_normie_pointless |
12 | flyposting - posters - defacement - sticker - advertisers | 63 | 12_flyposting_posters_defacement_sticker |
13 | protesting - extremists - conformists - revolutions - attend | 58 | 13_protesting_extremists_conformists_revolutions |
14 | revolt - fighting - ennemie - droits - violente | 48 | 14_revolt_fighting_ennemie_droits |
15 | unvaxxed - astrazeneca - bleeding - 2021 - pneumonia | 48 | 15_unvaxxed_astrazeneca_bleeding_2021 |
16 | freeview - unlicensed - cibtractvwith - beatthebailiffs - refunded | 45 | 16_freeview_unlicensed_cibtractvwith_beatthebailiffs |
17 | vaxgenocide - vandals - cittadini - videosorveglianza - nazista | 42 | 17_vaxgenocide_vandals_cittadini_videosorveglianza |
18 | bristol - manchesterat - wolverhampton - attending - rhondda | 39 | 18_bristol_manchesterat_wolverhampton_attending |
19 | australians - nsw - rallys - mebourne - nbn | 38 | 19_australians_nsw_rallys_mebourne |
20 | appreciated - yesterday - participated - greatest - congratulate | 34 | 20_appreciated_yesterday_participated_greatest |
21 | upl_unioneperleliberta - difendiamo - tutto - nati_liberi_to_be_free - silenzio | 31 | 21_upl_unioneperleliberta_difendiamo_tutto_nati_liberi_to_be_free |
22 | stickers - whiteboard - lol - postcard - decoration | 29 | 22_stickers_whiteboard_lol_postcard |
23 | irishpatriots - antifa - diarmaid - belgium - sovereign | 29 | 23_irishpatriots_antifa_diarmaid_belgium |
24 | worldwidedemonstration - wwrforfreedom - nationwideralliesforfreedom - fnqfreedomalliancenetwork - tiananmen | 27 | 24_worldwidedemonstration_wwrforfreedom_nationwideralliesforfreedom_fnqfreedomalliancenetwork |
25 | cashless - spycoin - wallet - tracked - currency | 25 | 25_cashless_spycoin_wallet_tracked |
26 | bbcisthevirus - manchester - thevmedia - confirmed - blackburn | 25 | 26_bbcisthevirus_manchester_thevmedia_confirmed |
27 | worldwidedemonstration - scotlandtgr - holyrood - edinburg - liberties | 25 | 27_worldwidedemonstration_scotlandtgr_holyrood_edinburg |
28 | qrcode - barcodes - scanned - smartphone - supermarket | 22 | 28_qrcode_barcodes_scanned_smartphone |
29 | bbc - arrests - stickered - newcastle - misinformation | 22 | 29_bbc_arrests_stickered_newcastle |
Training hyperparameters
- calculate_probabilities: True
- language: None
- low_memory: False
- min_topic_size: 10
- n_gram_range: (1, 1)
- nr_topics: None
- seed_topic_list: None
- top_n_words: 10
- verbose: False
- zeroshot_min_similarity: 0.7
- zeroshot_topic_list: None
Framework versions
- Numpy: 1.26.4
- HDBSCAN: 0.8.40
- UMAP: 0.5.7
- Pandas: 2.2.3
- Scikit-Learn: 1.5.2
- Sentence-transformers: 3.3.1
- Transformers: 4.46.3
- Numba: 0.60.0
- Plotly: 5.24.1
- Python: 3.10.12
- Downloads last month
- 5
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.