wwydmanski commited on
Commit
cb09e22
1 Parent(s): 49570a9

Upload folder using huggingface_hub

Browse files
Files changed (2) hide show
  1. README.md +109 -144
  2. model.safetensors +1 -1
README.md CHANGED
@@ -7,39 +7,43 @@ tags:
7
  - sentence-similarity
8
  - feature-extraction
9
  - generated_from_trainer
10
- - dataset_size:9988
11
  - loss:MultipleNegativesRankingLoss
12
  widget:
13
- - source_sentence: Splenomegaly in Malta fever
14
  sentences:
15
- - 'TROPICAL SPLENOMEGALY. '
16
- - '[Voluminous migrating spleen in the course of Malta fever: effects of splenectomy]. '
17
- - '[Adenoma of appendix]. '
18
- - source_sentence: sRNA regulation
 
 
19
  sentences:
20
- - 'SR proteins control a complex network of RNA-processing events. '
21
- - 'Convergence of submodality-specific input onto neurons in primary somatosensory
22
- cortex. '
23
- - 'Dynamic features of gene expression control by small regulatory RNAs. '
24
- - source_sentence: Foley catheter hysterosalpingography
25
  sentences:
26
- - 'Hysterosalpingography using a Foley catheter. '
27
- - '[Long-term follow-up of adult patients with isolated congenital AV block]. '
28
- - 'Hysterosalpingography. '
29
- - source_sentence: Anti-endoglin monoclonal antibodies
 
 
 
 
30
  sentences:
31
- - 'Cortisol response to general anaesthesia for medical imaging in children. '
32
- - 'Anti-endoglin monoclonal antibodies are effective for suppressing metastasis
33
- and the primary tumors by targeting tumor vasculature. '
34
- - 'Endoglin: Beyond the Endothelium. '
35
- - source_sentence: Alternariol Methyl Ether Quantitation
36
  sentences:
37
- - 'Stable isotope dilution assays of alternariol and alternariol monomethyl ether
38
- in beverages. '
39
- - 'The roles of eotaxin and the STAT6 signalling pathway in eosinophil recruitment
40
- and host resistance to the nematodes Nippostrongylus brasiliensis and Heligmosomoides
41
- bakeri. '
42
- - 'Mechanisms of Action and Toxicity of the Mycotoxin Alternariol: A Review. '
43
  ---
44
 
45
  # SentenceTransformer based on allenai/specter2_base
@@ -92,9 +96,9 @@ from sentence_transformers import SentenceTransformer
92
  model = SentenceTransformer("sentence_transformers_model_id")
93
  # Run inference
94
  sentences = [
95
- 'Alternariol Methyl Ether Quantitation',
96
- 'Stable isotope dilution assays of alternariol and alternariol monomethyl ether in beverages. ',
97
- 'Mechanisms of Action and Toxicity of the Mycotoxin Alternariol: A Review. ',
98
  ]
99
  embeddings = model.encode(sentences)
100
  print(embeddings.shape)
@@ -149,19 +153,19 @@ You can finetune this model on your own dataset.
149
  #### json
150
 
151
  * Dataset: json
152
- * Size: 9,988 training samples
153
  * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
154
  * Approximate statistics based on the first 1000 samples:
155
  | | anchor | positive | negative |
156
  |:--------|:---------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|
157
  | type | string | string | string |
158
- | details | <ul><li>min: 4 tokens</li><li>mean: 7.66 tokens</li><li>max: 34 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 19.05 tokens</li><li>max: 42 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 11.84 tokens</li><li>max: 48 tokens</li></ul> |
159
  * Samples:
160
- | anchor | positive | negative |
161
- |:-----------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------------|
162
- | <code>mechanotransduction pathways</code> | <code>Signalling cascades in mechanotransduction: cell-matrix interactions and mechanical loading. </code> | <code>Mechanotransduction: May the force be with you. </code> |
163
- | <code>FSR-tunable comb filter</code> | <code>Multiwavelength Raman fiber laser with a continuously-tunable spacing. </code> | <code>Tunable multiwavelength fiber laser using a comb filter based on erbium-ytterbium co-doped polarization maintaining fiber loop mirror. </code> |
164
- | <code>Radiation pneumonitis enhancement</code> | <code>Induction and concurrent taxanes enhance both the pulmonary metabolic radiation response and the radiation pneumonitis response in patients with esophagus cancer. </code> | <code>Imaging of Hypersensitivity Pneumonitis. </code> |
165
  * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
166
  ```json
167
  {
@@ -301,117 +305,78 @@ You can finetune this model on your own dataset.
301
  </details>
302
 
303
  ### Training Logs
304
- <details><summary>Click to expand</summary>
305
-
306
  | Epoch | Step | Training Loss |
307
  |:------:|:----:|:-------------:|
308
- | 0.0095 | 1 | 2.9432 |
309
- | 0.0190 | 2 | 3.0121 |
310
- | 0.0286 | 3 | 2.9051 |
311
- | 0.0381 | 4 | 2.7906 |
312
- | 0.0476 | 5 | 2.6592 |
313
- | 0.0571 | 6 | 2.2835 |
314
- | 0.0667 | 7 | 2.1373 |
315
- | 0.0762 | 8 | 1.7872 |
316
- | 0.0857 | 9 | 1.6329 |
317
- | 0.0952 | 10 | 1.5184 |
318
- | 0.1048 | 11 | 1.234 |
319
- | 0.1143 | 12 | 1.0315 |
320
- | 0.1238 | 13 | 0.9664 |
321
- | 0.1333 | 14 | 0.9369 |
322
- | 0.1429 | 15 | 0.6871 |
323
- | 0.1524 | 16 | 0.5633 |
324
- | 0.1619 | 17 | 0.5141 |
325
- | 0.1714 | 18 | 0.5259 |
326
- | 0.1810 | 19 | 0.4295 |
327
- | 0.1905 | 20 | 0.4585 |
328
- | 0.2 | 21 | 0.2799 |
329
- | 0.2095 | 22 | 0.4226 |
330
- | 0.2190 | 23 | 0.2524 |
331
- | 0.2286 | 24 | 0.2135 |
332
- | 0.2381 | 25 | 0.1958 |
333
- | 0.2476 | 26 | 0.1823 |
334
- | 0.2571 | 27 | 0.393 |
335
- | 0.2667 | 28 | 0.3186 |
336
- | 0.2762 | 29 | 0.1414 |
337
- | 0.2857 | 30 | 0.1927 |
338
- | 0.2952 | 31 | 0.2597 |
339
- | 0.3048 | 32 | 0.1291 |
340
- | 0.3143 | 33 | 0.1488 |
341
- | 0.3238 | 34 | 0.1203 |
342
- | 0.3333 | 35 | 0.2001 |
343
- | 0.3429 | 36 | 0.1877 |
344
- | 0.3524 | 37 | 0.0713 |
345
- | 0.3619 | 38 | 0.1778 |
346
- | 0.3714 | 39 | 0.1179 |
347
- | 0.3810 | 40 | 0.147 |
348
- | 0.3905 | 41 | 0.1158 |
349
- | 0.4 | 42 | 0.1003 |
350
- | 0.4095 | 43 | 0.158 |
351
- | 0.4190 | 44 | 0.159 |
352
- | 0.4286 | 45 | 0.063 |
353
- | 0.4381 | 46 | 0.1309 |
354
- | 0.4476 | 47 | 0.0327 |
355
- | 0.4571 | 48 | 0.1665 |
356
- | 0.4667 | 49 | 0.1064 |
357
- | 0.4762 | 50 | 0.0699 |
358
- | 0.4857 | 51 | 0.0674 |
359
- | 0.4952 | 52 | 0.0508 |
360
- | 0.5048 | 53 | 0.0493 |
361
- | 0.5143 | 54 | 0.0565 |
362
- | 0.5238 | 55 | 0.0366 |
363
- | 0.5333 | 56 | 0.0606 |
364
- | 0.5429 | 57 | 0.0727 |
365
- | 0.5524 | 58 | 0.092 |
366
- | 0.5619 | 59 | 0.0628 |
367
- | 0.5714 | 60 | 0.0369 |
368
- | 0.5810 | 61 | 0.0889 |
369
- | 0.5905 | 62 | 0.0409 |
370
- | 0.6 | 63 | 0.0545 |
371
- | 0.6095 | 64 | 0.0856 |
372
- | 0.6190 | 65 | 0.0478 |
373
- | 0.6286 | 66 | 0.0584 |
374
- | 0.6381 | 67 | 0.0757 |
375
- | 0.6476 | 68 | 0.0609 |
376
- | 0.6571 | 69 | 0.0381 |
377
- | 0.6667 | 70 | 0.069 |
378
- | 0.6762 | 71 | 0.0243 |
379
- | 0.6857 | 72 | 0.0517 |
380
- | 0.6952 | 73 | 0.0332 |
381
- | 0.7048 | 74 | 0.0662 |
382
- | 0.7143 | 75 | 0.0753 |
383
- | 0.7238 | 76 | 0.0914 |
384
- | 0.7333 | 77 | 0.1094 |
385
- | 0.7429 | 78 | 0.0557 |
386
- | 0.7524 | 79 | 0.0436 |
387
- | 0.7619 | 80 | 0.0137 |
388
- | 0.7714 | 81 | 0.0399 |
389
- | 0.7810 | 82 | 0.0278 |
390
- | 0.7905 | 83 | 0.0438 |
391
- | 0.8 | 84 | 0.1392 |
392
- | 0.8095 | 85 | 0.0299 |
393
- | 0.8190 | 86 | 0.0667 |
394
- | 0.8286 | 87 | 0.0404 |
395
- | 0.8381 | 88 | 0.0166 |
396
- | 0.8476 | 89 | 0.1679 |
397
- | 0.8571 | 90 | 0.0282 |
398
- | 0.8667 | 91 | 0.0628 |
399
- | 0.8762 | 92 | 0.0618 |
400
- | 0.8857 | 93 | 0.0167 |
401
- | 0.8952 | 94 | 0.2108 |
402
- | 0.9048 | 95 | 0.0749 |
403
- | 0.9143 | 96 | 0.0997 |
404
- | 0.9238 | 97 | 0.0675 |
405
- | 0.9333 | 98 | 0.0409 |
406
- | 0.9429 | 99 | 0.0355 |
407
- | 0.9524 | 100 | 0.1391 |
408
- | 0.9619 | 101 | 0.0938 |
409
- | 0.9714 | 102 | 0.0526 |
410
- | 0.9810 | 103 | 0.0035 |
411
- | 0.9905 | 104 | 0.0022 |
412
- | 1.0 | 105 | 0.0016 |
413
 
414
- </details>
415
 
416
  ### Framework Versions
417
  - Python: 3.9.19
 
7
  - sentence-similarity
8
  - feature-extraction
9
  - generated_from_trainer
10
+ - dataset_size:6574
11
  - loss:MultipleNegativesRankingLoss
12
  widget:
13
+ - source_sentence: sigma N protein interactions
14
  sentences:
15
+ - 'Smoking Relapse After Lung Transplantation: Is a Second Transplant Justified? '
16
+ - 'Core RNA polymerase and promoter DNA interactions of purified domains of sigma
17
+ N: bipartite functions. '
18
+ - 'Protein-protein interactions mapped by artificial proteases: where sigma factors
19
+ bind to RNA polymerase. '
20
+ - source_sentence: Frailty pathway co-design
21
  sentences:
22
+ - 'High-Sensitivity Cardiac Troponin I Levels in Normal and Hypertensive Pregnancy. '
23
+ - 'The systematic approach to improving care for Frail Older Patients (SAFE) study:
24
+ A protocol for co-designing a frail older person''s pathway. '
25
+ - 'Frailty: successful clinical practice implementation. '
26
+ - source_sentence: Diurnal lipid metabolism in lactating sheep
27
  sentences:
28
+ - 'Interpreting and applying the EUFEST results using number needed to treat: antipsychotic
29
+ effectiveness in first-episode schizophrenia. '
30
+ - 'Diurnal variations in the concentration, arteriovenous difference, extraction
31
+ ratio, and uptake of 3-hydroxybutyrate and plasma free fatty acids in the hind
32
+ limb of lactating sheep. '
33
+ - 'Diurnal regulation of milk lipid production and milk secretion in the rat: effect
34
+ of dietary protein and energy restriction. '
35
+ - source_sentence: Ectopic gastric mucosa
36
  sentences:
37
+ - '[Ectopic cardia and gastroesophageal reflux]. '
38
+ - 'A bacterial toxicity assay performed with microplates, microluminometry and Microtox
39
+ reagent. '
40
+ - 'Gastric polyp. '
41
+ - source_sentence: monograph editing
42
  sentences:
43
+ - 'Monographs editor. '
44
+ - 'Maternal stress and high-fat diet effect on maternal behavior, milk composition,
45
+ and pup ingestive behavior. '
46
+ - 'The editing life. '
 
 
47
  ---
48
 
49
  # SentenceTransformer based on allenai/specter2_base
 
96
  model = SentenceTransformer("sentence_transformers_model_id")
97
  # Run inference
98
  sentences = [
99
+ 'monograph editing',
100
+ 'Monographs editor. ',
101
+ 'The editing life. ',
102
  ]
103
  embeddings = model.encode(sentences)
104
  print(embeddings.shape)
 
153
  #### json
154
 
155
  * Dataset: json
156
+ * Size: 6,574 training samples
157
  * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
158
  * Approximate statistics based on the first 1000 samples:
159
  | | anchor | positive | negative |
160
  |:--------|:---------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|
161
  | type | string | string | string |
162
+ | details | <ul><li>min: 3 tokens</li><li>mean: 7.59 tokens</li><li>max: 33 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 19.89 tokens</li><li>max: 70 tokens</li></ul> | <ul><li>min: 3 tokens</li><li>mean: 11.97 tokens</li><li>max: 50 tokens</li></ul> |
163
  * Samples:
164
+ | anchor | positive | negative |
165
+ |:-------------------------------------------------|:--------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------|
166
+ | <code>α-Alumina Nanoparticle Grafting</code> | <code>Grafting PMMA Brushes from α-Alumina Nanoparticles via SI-ATRP. </code> | <code>Mesoporous alumina from colloidal biotemplating of Al clusters. </code> |
167
+ | <code>Congenital candidiasis septic shock</code> | <code>Congenital candidiasis presenting as septic shock without rash. </code> | <code>Congenital cutaneous candidiasis: clinical presentation, pathogenesis, and management guidelines. </code> |
168
+ | <code>Chronic Venous Occlusion</code> | <code>Anatomic response of canine hindlimb vasculature to chronic venous occlusion. </code> | <code>Chronic venous insufficiency. </code> |
169
  * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
170
  ```json
171
  {
 
305
  </details>
306
 
307
  ### Training Logs
 
 
308
  | Epoch | Step | Training Loss |
309
  |:------:|:----:|:-------------:|
310
+ | 0.0145 | 1 | 2.8777 |
311
+ | 0.0290 | 2 | 2.8723 |
312
+ | 0.0435 | 3 | 2.7432 |
313
+ | 0.0580 | 4 | 2.8806 |
314
+ | 0.0725 | 5 | 2.3007 |
315
+ | 0.0870 | 6 | 2.2423 |
316
+ | 0.1014 | 7 | 1.995 |
317
+ | 0.1159 | 8 | 1.5115 |
318
+ | 0.1304 | 9 | 1.41 |
319
+ | 0.1449 | 10 | 1.243 |
320
+ | 0.1594 | 11 | 1.1634 |
321
+ | 0.1739 | 12 | 1.1996 |
322
+ | 0.1884 | 13 | 1.3653 |
323
+ | 0.2029 | 14 | 1.5704 |
324
+ | 0.2174 | 15 | 1.3556 |
325
+ | 0.2319 | 16 | 1.4051 |
326
+ | 0.2464 | 17 | 1.0999 |
327
+ | 0.2609 | 18 | 1.0826 |
328
+ | 0.2754 | 19 | 1.0449 |
329
+ | 0.2899 | 20 | 1.0517 |
330
+ | 0.3043 | 21 | 0.9716 |
331
+ | 0.3188 | 22 | 1.1993 |
332
+ | 0.3333 | 23 | 1.1375 |
333
+ | 0.3478 | 24 | 0.9875 |
334
+ | 0.3623 | 25 | 0.7656 |
335
+ | 0.3768 | 26 | 1.2773 |
336
+ | 0.3913 | 27 | 0.7802 |
337
+ | 0.4058 | 28 | 0.882 |
338
+ | 0.4203 | 29 | 1.0534 |
339
+ | 0.4348 | 30 | 0.9073 |
340
+ | 0.4493 | 31 | 0.916 |
341
+ | 0.4638 | 32 | 0.9702 |
342
+ | 0.4783 | 33 | 1.2868 |
343
+ | 0.4928 | 34 | 1.0854 |
344
+ | 0.5072 | 35 | 0.8832 |
345
+ | 0.5217 | 36 | 0.9139 |
346
+ | 0.5362 | 37 | 0.9032 |
347
+ | 0.5507 | 38 | 0.965 |
348
+ | 0.5652 | 39 | 0.7222 |
349
+ | 0.5797 | 40 | 0.6682 |
350
+ | 0.5942 | 41 | 0.8562 |
351
+ | 0.6087 | 42 | 0.9248 |
352
+ | 0.6232 | 43 | 0.9867 |
353
+ | 0.6377 | 44 | 0.7328 |
354
+ | 0.6522 | 45 | 0.7506 |
355
+ | 0.6667 | 46 | 0.7952 |
356
+ | 0.6812 | 47 | 0.7979 |
357
+ | 0.6957 | 48 | 1.0043 |
358
+ | 0.7101 | 49 | 1.0428 |
359
+ | 0.7246 | 50 | 0.8772 |
360
+ | 0.7391 | 51 | 0.6598 |
361
+ | 0.7536 | 52 | 0.7804 |
362
+ | 0.7681 | 53 | 0.599 |
363
+ | 0.7826 | 54 | 0.7974 |
364
+ | 0.7971 | 55 | 0.7489 |
365
+ | 0.8116 | 56 | 0.8701 |
366
+ | 0.8261 | 57 | 0.8903 |
367
+ | 0.8406 | 58 | 0.7223 |
368
+ | 0.8551 | 59 | 0.925 |
369
+ | 0.8696 | 60 | 1.0247 |
370
+ | 0.8841 | 61 | 0.7531 |
371
+ | 0.8986 | 62 | 0.9684 |
372
+ | 0.9130 | 63 | 0.7462 |
373
+ | 0.9275 | 64 | 0.8555 |
374
+ | 0.9420 | 65 | 0.8016 |
375
+ | 0.9565 | 66 | 0.7603 |
376
+ | 0.9710 | 67 | 1.1052 |
377
+ | 0.9855 | 68 | 0.9505 |
378
+ | 1.0 | 69 | 0.6259 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
379
 
 
380
 
381
  ### Framework Versions
382
  - Python: 3.9.19
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bf41cf85b1182745d1af9be640cb3a0cfae661bbd771632fc9c7a2eaddb57599
3
  size 439696224
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8bde86f785555d47618677bc7c74848231a3556a1eb547e6ded8a24d9917051b
3
  size 439696224