kardosdrur
commited on
Upload folder using huggingface_hub
Browse files- README.md +27 -15
- model.joblib +2 -2
README.md
CHANGED
@@ -25,26 +25,38 @@ model.print_topics()
|
|
25 |
The model is structured as follows:
|
26 |
|
27 |
```
|
28 |
-
|
29 |
-
|
30 |
-
|
|
|
|
|
31 |
```
|
32 |
|
33 |
## Topics
|
34 |
The topics discovered by the model are the following:
|
35 |
|
36 |
-
| Topic ID | Highest Ranking |
|
37 |
-
| - | - |
|
38 |
-
| 0 |
|
39 |
-
| 1 |
|
40 |
-
| 2 |
|
41 |
-
| 3 |
|
42 |
-
| 4 |
|
43 |
-
| 5 |
|
44 |
-
| 6 |
|
45 |
-
| 7 |
|
46 |
-
| 8 |
|
47 |
-
| 9 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
48 |
|
49 |
## Package versions
|
50 |
|
|
|
25 |
The model is structured as follows:
|
26 |
|
27 |
```
|
28 |
+
ClusteringTopicModel(clustering=KMeans(n_clusters=20),
|
29 |
+
dimensionality_reduction=PCA(n_components=5),
|
30 |
+
feature_importance='c-tf-idf',
|
31 |
+
vectorizer=CountVectorizer(min_df=10,
|
32 |
+
stop_words='english'))
|
33 |
```
|
34 |
|
35 |
## Topics
|
36 |
The topics discovered by the model are the following:
|
37 |
|
38 |
+
| Topic ID | Highest Ranking |
|
39 |
+
| - | - |
|
40 |
+
| 0 | ax, max, g9v, b8f, jpeg, pl, a86, db, 1d9, file |
|
41 |
+
| 1 | drive, scsi, price, card, sale, 00, shipping, ram, pc, offer |
|
42 |
+
| 2 | pathetic, path, patient, patience, paths, pathology, patrick, patent, patently, patriot |
|
43 |
+
| 3 | key, encryption, government, clipper, chip, keys, law, use, nsa, escrow |
|
44 |
+
| 4 | people, right, don, think, just, government, like, say, does, rights |
|
45 |
+
| 5 | game, team, year, 25, play, games, players, 10, 55, season |
|
46 |
+
| 6 | dos, windows, image, file, edu, ftp, version, files, available, program |
|
47 |
+
| 7 | god, jesus, bible, people, christ, believe, christians, christian, faith, say |
|
48 |
+
| 8 | mr, president, people, fbi, gun, think, did, don, batf, know |
|
49 |
+
| 9 | space, use, new, launch, used, like, don, know, just, 00 |
|
50 |
+
| 10 | god, jews, people, church, does, did, christian, greek, just, israel |
|
51 |
+
| 11 | car, just, like, don, people, think, money, insurance, make, time |
|
52 |
+
| 12 | software, windows, thanks, know, version, does, ftp, available, xfree86, pc |
|
53 |
+
| 13 | ax, edu, information, pub, space, ftp, data, mail, file, entry |
|
54 |
+
| 14 | hockey, game, games, team, season, nhl, la, league, don, pts |
|
55 |
+
| 15 | armenian, armenians, turkish, people, said, israel, jews, genocide, israeli, armenia |
|
56 |
+
| 16 | 00, car, new, 50, price, bike, good, like, 1st, 10 |
|
57 |
+
| 17 | like, just, time, problem, don, use, know, vitamin, good, think |
|
58 |
+
| 18 | drive, scsi, card, disk, windows, controller, drives, use, bus, ide |
|
59 |
+
| 19 | ax, max, edu, com, b8f, ah, 145, a86, pl, air |
|
60 |
|
61 |
## Package versions
|
62 |
|
model.joblib
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:7c44d3bc14662cd8548ed94cbb5294496d2d8c20874fb2f5189797b0a0fbce92
|
3 |
+
size 139475171
|