kardosdrur commited on
Commit
0854211
·
verified ·
1 Parent(s): 778b381

Upload folder using huggingface_hub

Browse files
Files changed (2) hide show
  1. README.md +27 -15
  2. model.joblib +2 -2
README.md CHANGED
@@ -25,26 +25,38 @@ model.print_topics()
25
  The model is structured as follows:
26
 
27
  ```
28
- SemanticSignalSeparation(decomposition=FastICA(n_components=10),
29
- vectorizer=CountVectorizer(min_df=10,
30
- stop_words='english'))
 
 
31
  ```
32
 
33
  ## Topics
34
  The topics discovered by the model are the following:
35
 
36
- | Topic ID | Highest Ranking | Lowest Ranking |
37
- | - | - | - |
38
- | 0 | goaltenders, nhl, bullpen, sabres, goaltender, puckett, leafs, braves, pitchers, canucks | accelerator, malaysia, automobile, accelerators, mazda, automobiles, automotive, vehicle, silicon, britain |
39
- | 1 | saturn, suzuki, symptoms, jupiter, bmw, exhaust, volvo, engine, mazda, propulsion | wiretapping, wiretaps, nsa, spying, eavesdropping, wiretap, security, encryption, enforcement, safeguarding |
40
- | 2 | drawbacks, advantages, productivity, efficiency, innovation, economical, disadvantages, proponents, competitiveness, economically | address, instructions, serial, arrived, codes, configured, 9591, 16550, contacting, recieved |
41
- | 3 | publishes, archives, publisher, scholars, manuscripts, npr, affiliated, revelations, discusses, archive | motorcycling, motorcycles, speeding, motorcycle, driving, motorcyclist, riding, harleys, braking, vehicles |
42
- | 4 | motherboard, ram, motherboards, processor, cmos, hardware, chipset, chipsets, amd, mb | yale, sunroof, damphousse, library, npr, billboards, balloon, schools, kerosene, nicholas |
43
- | 5 | palestinians, palestinian, gazans, gaza, genocide, israelis, atrocities, israeli, hamas, holocaust | motorola, mastercard, technician, smartdrive, telephony, transmissions, phones, electronically, voyager, cruising |
44
- | 6 | spectrometer, makefile, biochemistry, dblspace, bibliography, booklet, bookstores, circumference, nutritional, statistically | uh, um, em, yeah, oh, er, ah, yer, yo, ye |
45
- | 7 | theology, theological, scripture, theologians, christianity, biblical, agnosticism, devout, agnostic, christians | missiles, munitions, soviets, artillery, bunker, missile, explosives, tactical, grenades, soviet |
46
- | 8 | causes, metabolism, obstruction, bugging, xsession, disabling, debugger, behaviour, syndrome, occurs | prices, pricing, price, affordable, cheap, forsale, inexpensive, cost, purchases, priced |
47
- | 9 | xcreatewindow, programmable, bitmap, bitmaps, colormaps, freeware, gui, imagewriter, colormap, adobe | discrepancy, inaccuracies, defective, debacle, unrecognized, faulty, misconception, sceptical, refutation, warranted |
 
 
 
 
 
 
 
 
 
 
48
 
49
  ## Package versions
50
 
 
25
  The model is structured as follows:
26
 
27
  ```
28
+ ClusteringTopicModel(clustering=KMeans(n_clusters=20),
29
+ dimensionality_reduction=PCA(n_components=5),
30
+ feature_importance='c-tf-idf',
31
+ vectorizer=CountVectorizer(min_df=10,
32
+ stop_words='english'))
33
  ```
34
 
35
  ## Topics
36
  The topics discovered by the model are the following:
37
 
38
+ | Topic ID | Highest Ranking |
39
+ | - | - |
40
+ | 0 | ax, max, g9v, b8f, jpeg, pl, a86, db, 1d9, file |
41
+ | 1 | drive, scsi, price, card, sale, 00, shipping, ram, pc, offer |
42
+ | 2 | pathetic, path, patient, patience, paths, pathology, patrick, patent, patently, patriot |
43
+ | 3 | key, encryption, government, clipper, chip, keys, law, use, nsa, escrow |
44
+ | 4 | people, right, don, think, just, government, like, say, does, rights |
45
+ | 5 | game, team, year, 25, play, games, players, 10, 55, season |
46
+ | 6 | dos, windows, image, file, edu, ftp, version, files, available, program |
47
+ | 7 | god, jesus, bible, people, christ, believe, christians, christian, faith, say |
48
+ | 8 | mr, president, people, fbi, gun, think, did, don, batf, know |
49
+ | 9 | space, use, new, launch, used, like, don, know, just, 00 |
50
+ | 10 | god, jews, people, church, does, did, christian, greek, just, israel |
51
+ | 11 | car, just, like, don, people, think, money, insurance, make, time |
52
+ | 12 | software, windows, thanks, know, version, does, ftp, available, xfree86, pc |
53
+ | 13 | ax, edu, information, pub, space, ftp, data, mail, file, entry |
54
+ | 14 | hockey, game, games, team, season, nhl, la, league, don, pts |
55
+ | 15 | armenian, armenians, turkish, people, said, israel, jews, genocide, israeli, armenia |
56
+ | 16 | 00, car, new, 50, price, bike, good, like, 1st, 10 |
57
+ | 17 | like, just, time, problem, don, use, know, vitamin, good, think |
58
+ | 18 | drive, scsi, card, disk, windows, controller, drives, use, bus, ide |
59
+ | 19 | ax, max, edu, com, b8f, ah, 145, a86, pl, air |
60
 
61
  ## Package versions
62
 
model.joblib CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:414ec7e8a0e127ce291f85425722f0df7219e16eb236a972dd5314e6d2cdb2f8
3
- size 145878291
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7c44d3bc14662cd8548ed94cbb5294496d2d8c20874fb2f5189797b0a0fbce92
3
+ size 139475171