TopicModelling / README.md
heyitskim1912's picture
Add BERTopic model
1314de6
|
raw
history blame
6.57 kB
---
tags:
- bertopic
library_name: bertopic
pipeline_tag: text-classification
---
# TopicModelling
This is a [BERTopic](https://github.com/MaartenGr/BERTopic) model.
BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.
## Usage
To use this model, please install BERTopic:
```
pip install -U bertopic
```
You can use the model as follows:
```python
from bertopic import BERTopic
topic_model = BERTopic.load("heyitskim1912/TopicModelling")
topic_model.get_topic_info()
```
## Topic overview
* Number of topics: 36
* Number of training documents: 1845
<details>
<summary>Click here for an overview of all topics.</summary>
| Topic ID | Topic Keywords | Topic Frequency | Label |
|----------|----------------|-----------------|-------|
| -1 | growth rate - immersive - subscriber - future - long term | 10 | Outliers |
| 0 | expense growth - quarter versus - operating margin - lower expected - headwind | 564 | Decreased Operating Income and Higher Expenses |
| 1 | going forward - pricing actions - weâ ve - latin america - subscriber base | 153 | Cost Drivers and Pricing Strategy |
| 2 | distributed computing - process automation - windows 11 - power platform - active users | 146 | Digital Transformation |
| 3 | strong prior - continued strength - benefiting - revenue increased - driven growth | 128 | Revenue Growth and Performance |
| 4 | revenue growth - growth cable - growth sequentially - operating results - increased versus | 123 | Operating Results and Revenue Performance |
| 5 | growth strategy - profitability - efficiencies - resilience - significant progress | 66 | Resilience and Growth Strategy in Challenging Times |
| 6 | innovate - diversify - expanding opportunity - leveraging - customization | 50 | Driving Growth through Innovation and Expansion |
| 7 | personalizing experiences - transformative - experiences make - unparalleled - human connection | 48 | Transforming Entertainment through Unparalleled Storytelling |
| 8 | growth continued - strong job - demand strong - revenue growth - momentum microsoft | 45 | Sustained Growth and Engagement in Gaming and Office Consumer |
| 9 | guidance provided - year 2022 - integration costs - reopening - based current | 31 | Forward-looking Statements and Cautionary Statements |
| 10 | unwavering - reinvention - human connection - strategic investments - leadership team | 29 | Commitment to Reinvention and Partner Support |
| 11 | disneyland paris - entire quarter - including shanghai - navigating - limited number | 27 | Impact of Pandemic on Disney Theme Parks |
| 12 | staffing issues - shutdowns china - quarter diluted - onset pandemic - impacting | 26 | Supply Chain Disruptions and Inflationary Costs during Omicron Variant |
| 13 | strong demand - really strong - growth driving - great content - opportunity market | 26 | Strong Demand and Increased Per Capita Spending |
| 14 | improvement cloud - increased slightly - improvements azure - margin businesses - revenue mix | 25 | Improved Gross Margin Percentage in Cloud Services |
| 15 | star wars - pixar - jungle cruise - franchises including - disney day | 25 | Upcoming Content Expansion and Exciting Releases |
| 16 | consistently strong - continued growth - strong revenue - strong quarter - stellar performance | 25 | Revenue Growth and Performance |
| 17 | optimize - believe prudent - singularly focused - enhancements - executing reinvention | 24 | Commitment to Reinvention and Partner Support |
| 18 | customer experience - consumer products - iced coffee - coffee innovation - selling iced | 24 | Digital Engagement |
| 19 | strong food - diverse customer - strategically manage - growth digital - marketing solutions | 21 | Audience Engagement |
| 20 | espn advertising - continue perform - entertainment titles - nba finals - general entertainment | 20 | Sports Broadcasting and Streaming Success |
| 21 | innovation driving - growth continued - drive healthy - advanced security - demand microsoft | 19 | Microsoft Windows Commercial Products and Cloud Services Growth |
| 22 | revenue growth - segment revenue - enterprise services - productivity business - personal computing | 18 | Microsoft Revenue Outlook by Business Segments |
| 23 | customizing - experience possible - connect enables - feel good - loyalty program | 18 | Digital Rewards and Enhanced Customer Experience |
| 24 | strong annuity - significant growth - strong execution - demand strong - increased commitment | 17 | Microsoft Windows Commercial Products and Cloud Services Growth |
| 25 | driven growth - rewards program - growth asia - new stores - business q4 | 17 | Expansion in China |
| 26 | million subscriptions - million subscribers - sales mix - paid subscribers - q1 results | 16 | Continuous Growth of Subscriptions and ESPN+ |
| 27 | intensify - remain stable - stronger expected - fy2022 - anticipated half | 14 | Continuous Growth of Subscriptions and ESPN+ |
| 28 | increased slightly - increased constant - income increased - increased 22 - margin dollars | 14 | Significant Growth in Operating Income and Expenses |
| 29 | resonating - windows 11 - disneyland paris - alluded - advertisers | 14 | Successful Launch of Disney+ Ad-Supported Subscription |
| 30 | growing demand - increase versus - support growing - simplifying - fiscal 2023 | 13 | Investment in Cloud Services to Meet Growing Demand |
| 31 | experiences make - accretive business - integrations teams - connect enables - metrics | 13 | Reinventing Retail Partner Experience for Growth |
| 32 | excellence innovation - strong momentum - innovation audience - storytelling excellence - leading benefits | 13 | Driving Performance and Innovation through Strategic Investments |
| 33 | metrics - non gaap - currency dollars - fiscal 2022 - constant currency | 12 | Currency-Based Outlook for Performance Forecasting |
| 34 | robust demand - strong performance - driven growth - growth global - revenue increased | 11 | Strong Revenue Growth in Channel Development |
</details>
## Training hyperparameters
* calculate_probabilities: True
* language: None
* low_memory: False
* min_topic_size: 10
* n_gram_range: (1, 1)
* nr_topics: None
* seed_topic_list: None
* top_n_words: 10
* verbose: True
## Framework versions
* Numpy: 1.22.4
* HDBSCAN: 0.8.29
* UMAP: 0.5.3
* Pandas: 1.5.3
* Scikit-Learn: 1.2.2
* Sentence-transformers: 2.2.2
* Transformers: 4.30.2
* Numba: 0.56.4
* Plotly: 5.13.1
* Python: 3.10.12