kperkins411
/

msmarco-distilbert-base-v2_triplet_legal

@@ -1,4 +1,5 @@
 ---
 datasets: []
 language: []
 library_name: sentence-transformers
@@ -187,7 +188,7 @@ widget:
     and/or any of its affiliates and the directors, officers and employees of Domini
     and/or any of its affiliates.
 model-index:
-- name: SentenceTransformer
   results:
   - task:
       type: information-retrieval
@@ -200,103 +201,103 @@ model-index:
       value: 0.3953048087845513
       name: Cosine Accuracy@1
     - type: cosine_accuracy@3
-      value: 0.5342673229837183
       name: Cosine Accuracy@3
     - type: cosine_accuracy@5
-      value: 0.5914426353653919
       name: Cosine Accuracy@5
     - type: cosine_accuracy@10
-      value: 0.66565694812571
       name: Cosine Accuracy@10
     - type: cosine_precision@1
       value: 0.3953048087845513
       name: Cosine Precision@1
     - type: cosine_precision@3
-      value: 0.17808910766123942
       name: Cosine Precision@3
     - type: cosine_precision@5
-      value: 0.11828852707307837
       name: Cosine Precision@5
     - type: cosine_precision@10
-      value: 0.06656569481257099
       name: Cosine Precision@10
     - type: cosine_recall@1
       value: 0.3953048087845513
       name: Cosine Recall@1
     - type: cosine_recall@3
-      value: 0.5342673229837183
       name: Cosine Recall@3
     - type: cosine_recall@5
-      value: 0.5914426353653919
       name: Cosine Recall@5
     - type: cosine_recall@10
-      value: 0.66565694812571
       name: Cosine Recall@10
     - type: cosine_ndcg@10
-      value: 0.5240873176000084
       name: Cosine Ndcg@10
     - type: cosine_mrr@10
-      value: 0.4794995582481382
       name: Cosine Mrr@10
     - type: cosine_map@100
-      value: 0.4872380542829767
       name: Cosine Map@100
     - type: dot_accuracy@1
-      value: 0.3934115865202575
       name: Dot Accuracy@1
     - type: dot_accuracy@3
-      value: 0.5312381673608482
       name: Dot Accuracy@3
     - type: dot_accuracy@5
-      value: 0.5899280575539568
       name: Dot Accuracy@5
     - type: dot_accuracy@10
-      value: 0.6648996592199924
       name: Dot Accuracy@10
     - type: dot_precision@1
-      value: 0.3934115865202575
       name: Dot Precision@1
     - type: dot_precision@3
-      value: 0.1770793891202827
       name: Dot Precision@3
     - type: dot_precision@5
-      value: 0.11798561151079137
       name: Dot Precision@5
     - type: dot_precision@10
-      value: 0.06648996592199924
       name: Dot Precision@10
     - type: dot_recall@1
-      value: 0.3934115865202575
       name: Dot Recall@1
     - type: dot_recall@3
-      value: 0.5312381673608482
       name: Dot Recall@3
     - type: dot_recall@5
-      value: 0.5899280575539568
       name: Dot Recall@5
     - type: dot_recall@10
-      value: 0.6648996592199924
       name: Dot Recall@10
     - type: dot_ndcg@10
-      value: 0.5224316548033627
       name: Dot Ndcg@10
     - type: dot_mrr@10
-      value: 0.4775905591316421
       name: Dot Mrr@10
     - type: dot_map@100
-      value: 0.485319730256097
       name: Dot Map@100
 ---
-# SentenceTransformer
-This is a [sentence-transformers](https://www.SBERT.net) model trained. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
 ## Model Details
 ### Model Description
 - **Model Type:** Sentence Transformer
-<!-- - **Base model:** [Unknown](https://huggingface.co/unknown) -->
 - **Maximum Sequence Length:** 350 tokens
 - **Output Dimensionality:** 768 tokens
 - **Similarity Function:** Cosine Similarity
@@ -383,38 +384,38 @@ You can finetune this model on your own dataset.
 * Dataset: `msmarco-distilbert-base-v2`
 * Evaluated with [<code>InformationRetrievalEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.InformationRetrievalEvaluator)
-| Metric              | Value      |
-|:--------------------|:-----------|
-| cosine_accuracy@1   | 0.3953     |
-| cosine_accuracy@3   | 0.5343     |
-| cosine_accuracy@5   | 0.5914     |
-| cosine_accuracy@10  | 0.6657     |
-| cosine_precision@1  | 0.3953     |
-| cosine_precision@3  | 0.1781     |
-| cosine_precision@5  | 0.1183     |
-| cosine_precision@10 | 0.0666     |
-| cosine_recall@1     | 0.3953     |
-| cosine_recall@3     | 0.5343     |
-| cosine_recall@5     | 0.5914     |
-| cosine_recall@10    | 0.6657     |
-| cosine_ndcg@10      | 0.5241     |
-| cosine_mrr@10       | 0.4795     |
-| **cosine_map@100**  | **0.4872** |
-| dot_accuracy@1      | 0.3934     |
-| dot_accuracy@3      | 0.5312     |
-| dot_accuracy@5      | 0.5899     |
-| dot_accuracy@10     | 0.6649     |
-| dot_precision@1     | 0.3934     |
-| dot_precision@3     | 0.1771     |
-| dot_precision@5     | 0.118      |
-| dot_precision@10    | 0.0665     |
-| dot_recall@1        | 0.3934     |
-| dot_recall@3        | 0.5312     |
-| dot_recall@5        | 0.5899     |
-| dot_recall@10       | 0.6649     |
-| dot_ndcg@10         | 0.5224     |
-| dot_mrr@10          | 0.4776     |
-| dot_map@100         | 0.4853     |
 <!--
 ## Bias, Risks and Limitations
@@ -489,6 +490,7 @@ You can finetune this model on your own dataset.
 - `per_device_train_batch_size`: 128
 - `per_device_eval_batch_size`: 128
 - `learning_rate`: 2e-05
 - `warmup_ratio`: 0.1
 - `fp16`: True
 - `load_best_model_at_end`: True
@@ -513,7 +515,7 @@ You can finetune this model on your own dataset.
 - `adam_beta2`: 0.999
 - `adam_epsilon`: 1e-08
 - `max_grad_norm`: 1.0
-- `num_train_epochs`: 3
 - `max_steps`: -1
 - `lr_scheduler_type`: linear
 - `lr_scheduler_kwargs`: {}
@@ -611,30 +613,54 @@ You can finetune this model on your own dataset.
 ### Training Logs
 | Epoch      | Step     | Training Loss | loss       | msmarco-distilbert-base-v2_cosine_map@100 |
 |:----------:|:--------:|:-------------:|:----------:|:-----------------------------------------:|
-| 0          | 0        | -             | -          | 0.4899                                    |
-| 0.1453     | 100      | 0.0787        | -          | -                                         |
-| 0.2907     | 200      | 0.0503        | -          | -                                         |
-| 0.4360     | 300      | 0.0529        | -          | -                                         |
-| 0.5814     | 400      | 0.0636        | -          | -                                         |
-| 0.7267     | 500      | 0.0783        | -          | -                                         |
-| 0.8721     | 600      | 0.0765        | -          | -                                         |
-| 1.0131     | 697      | -             | 0.2284     | -                                         |
-| 1.0044     | 700      | 0.0776        | -          | -                                         |
-| 1.1497     | 800      | 0.0624        | -          | -                                         |
-| 1.2951     | 900      | 0.0289        | -          | -                                         |
-| 1.4404     | 1000     | 0.0244        | -          | -                                         |
-| 1.5858     | 1100     | 0.0256        | -          | -                                         |
-| 1.7311     | 1200     | 0.0364        | -          | -                                         |
-| 1.8765     | 1300     | 0.0334        | -          | -                                         |
-| 2.0131     | 1394     | -             | 0.2175     | -                                         |
-| 2.0087     | 1400     | 0.0342        | -          | -                                         |
-| 2.1541     | 1500     | 0.0274        | -          | -                                         |
-| 2.2994     | 1600     | 0.0153        | -          | -                                         |
-| 2.4448     | 1700     | 0.0167        | -          | -                                         |
-| 2.5901     | 1800     | 0.0178        | -          | -                                         |
-| 2.7355     | 1900     | 0.0221        | -          | -                                         |
-| 2.8808     | 2000     | 0.0227        | -          | -                                         |
-| **2.9738** | **2064** | **-**         | **0.1821** | **0.4872**                                |
 * The bold row denotes the saved checkpoint.

 ---
+base_model: sentence-transformers/msmarco-distilbert-base-v2
 datasets: []
 language: []
 library_name: sentence-transformers
     and/or any of its affiliates and the directors, officers and employees of Domini
     and/or any of its affiliates.
 model-index:
+- name: SentenceTransformer based on sentence-transformers/msmarco-distilbert-base-v2
   results:
   - task:
       type: information-retrieval
       value: 0.3953048087845513
       name: Cosine Accuracy@1
     - type: cosine_accuracy@3
+      value: 0.5376751230594472
       name: Cosine Accuracy@3
     - type: cosine_accuracy@5
+      value: 0.594471790988262
       name: Cosine Accuracy@5
     - type: cosine_accuracy@10
+      value: 0.673608481635744
       name: Cosine Accuracy@10
     - type: cosine_precision@1
       value: 0.3953048087845513
       name: Cosine Precision@1
     - type: cosine_precision@3
+      value: 0.1792250410198157
       name: Cosine Precision@3
     - type: cosine_precision@5
+      value: 0.1188943581976524
       name: Cosine Precision@5
     - type: cosine_precision@10
+      value: 0.06736084816357439
       name: Cosine Precision@10
     - type: cosine_recall@1
       value: 0.3953048087845513
       name: Cosine Recall@1
     - type: cosine_recall@3
+      value: 0.5376751230594472
       name: Cosine Recall@3
     - type: cosine_recall@5
+      value: 0.594471790988262
       name: Cosine Recall@5
     - type: cosine_recall@10
+      value: 0.673608481635744
       name: Cosine Recall@10
     - type: cosine_ndcg@10
+      value: 0.5276829229789854
       name: Cosine Ndcg@10
     - type: cosine_mrr@10
+      value: 0.4818510605049796
       name: Cosine Mrr@10
     - type: cosine_map@100
+      value: 0.48897515764559735
       name: Cosine Map@100
     - type: dot_accuracy@1
+      value: 0.3964407421431276
       name: Dot Accuracy@1
     - type: dot_accuracy@3
+      value: 0.5335100340780008
       name: Dot Accuracy@3
     - type: dot_accuracy@5
+      value: 0.5933358576296858
       name: Dot Accuracy@5
     - type: dot_accuracy@10
+      value: 0.6743657705414615
       name: Dot Accuracy@10
     - type: dot_precision@1
+      value: 0.3964407421431276
       name: Dot Precision@1
     - type: dot_precision@3
+      value: 0.17783667802600023
       name: Dot Precision@3
     - type: dot_precision@5
+      value: 0.11866717152593716
       name: Dot Precision@5
     - type: dot_precision@10
+      value: 0.06743657705414616
       name: Dot Precision@10
     - type: dot_recall@1
+      value: 0.3964407421431276
       name: Dot Recall@1
     - type: dot_recall@3
+      value: 0.5335100340780008
       name: Dot Recall@3
     - type: dot_recall@5
+      value: 0.5933358576296858
       name: Dot Recall@5
     - type: dot_recall@10
+      value: 0.6743657705414615
       name: Dot Recall@10
     - type: dot_ndcg@10
+      value: 0.5274757216450244
       name: Dot Ndcg@10
     - type: dot_mrr@10
+      value: 0.4814724160521211
       name: Dot Mrr@10
     - type: dot_map@100
+      value: 0.4884569183065979
       name: Dot Map@100
 ---
+# SentenceTransformer based on sentence-transformers/msmarco-distilbert-base-v2
+This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [sentence-transformers/msmarco-distilbert-base-v2](https://huggingface.co/sentence-transformers/msmarco-distilbert-base-v2). It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
 ## Model Details
 ### Model Description
 - **Model Type:** Sentence Transformer
+- **Base model:** [sentence-transformers/msmarco-distilbert-base-v2](https://huggingface.co/sentence-transformers/msmarco-distilbert-base-v2) <!-- at revision 741fcf2d6eabaf0927bfe49c6d9c577df95d3c40 -->
 - **Maximum Sequence Length:** 350 tokens
 - **Output Dimensionality:** 768 tokens
 - **Similarity Function:** Cosine Similarity
 * Dataset: `msmarco-distilbert-base-v2`
 * Evaluated with [<code>InformationRetrievalEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.InformationRetrievalEvaluator)
+| Metric              | Value     |
+|:--------------------|:----------|
+| cosine_accuracy@1   | 0.3953    |
+| cosine_accuracy@3   | 0.5377    |
+| cosine_accuracy@5   | 0.5945    |
+| cosine_accuracy@10  | 0.6736    |
+| cosine_precision@1  | 0.3953    |
+| cosine_precision@3  | 0.1792    |
+| cosine_precision@5  | 0.1189    |
+| cosine_precision@10 | 0.0674    |
+| cosine_recall@1     | 0.3953    |
+| cosine_recall@3     | 0.5377    |
+| cosine_recall@5     | 0.5945    |
+| cosine_recall@10    | 0.6736    |
+| cosine_ndcg@10      | 0.5277    |
+| cosine_mrr@10       | 0.4819    |
+| **cosine_map@100**  | **0.489** |
+| dot_accuracy@1      | 0.3964    |
+| dot_accuracy@3      | 0.5335    |
+| dot_accuracy@5      | 0.5933    |
+| dot_accuracy@10     | 0.6744    |
+| dot_precision@1     | 0.3964    |
+| dot_precision@3     | 0.1778    |
+| dot_precision@5     | 0.1187    |
+| dot_precision@10    | 0.0674    |
+| dot_recall@1        | 0.3964    |
+| dot_recall@3        | 0.5335    |
+| dot_recall@5        | 0.5933    |
+| dot_recall@10       | 0.6744    |
+| dot_ndcg@10         | 0.5275    |
+| dot_mrr@10          | 0.4815    |
+| dot_map@100         | 0.4885    |
 <!--
 ## Bias, Risks and Limitations
 - `per_device_train_batch_size`: 128
 - `per_device_eval_batch_size`: 128
 - `learning_rate`: 2e-05
+- `num_train_epochs`: 6
 - `warmup_ratio`: 0.1
 - `fp16`: True
 - `load_best_model_at_end`: True
 - `adam_beta2`: 0.999
 - `adam_epsilon`: 1e-08
 - `max_grad_norm`: 1.0
+- `num_train_epochs`: 6
 - `max_steps`: -1
 - `lr_scheduler_type`: linear
 - `lr_scheduler_kwargs`: {}
 ### Training Logs
 | Epoch      | Step     | Training Loss | loss       | msmarco-distilbert-base-v2_cosine_map@100 |
 |:----------:|:--------:|:-------------:|:----------:|:-----------------------------------------:|
+| 0          | 0        | -             | -          | 0.4145                                    |
+| 0.1453     | 100      | 1.7626        | -          | -                                         |
+| 0.2907     | 200      | 0.9595        | -          | -                                         |
+| 0.4360     | 300      | 0.7263        | -          | -                                         |
+| 0.5814     | 400      | 0.6187        | -          | -                                         |
+| 0.7267     | 500      | 0.5571        | -          | -                                         |
+| 0.8721     | 600      | 0.4885        | -          | -                                         |
+| 1.0131     | 697      | -             | 0.3676     | -                                         |
+| 1.0044     | 700      | 0.4283        | -          | -                                         |
+| 1.1497     | 800      | 0.3956        | -          | -                                         |
+| 1.2951     | 900      | 0.2941        | -          | -                                         |
+| 1.4404     | 1000     | 0.2437        | -          | -                                         |
+| 1.5858     | 1100     | 0.1988        | -          | -                                         |
+| 1.7311     | 1200     | 0.185         | -          | -                                         |
+| 1.8765     | 1300     | 0.1571        | -          | -                                         |
+| 2.0131     | 1394     | -             | 0.2679     | -                                         |
+| 2.0087     | 1400     | 0.1409        | -          | -                                         |
+| 2.1541     | 1500     | 0.1368        | -          | -                                         |
+| 2.2994     | 1600     | 0.111         | -          | -                                         |
+| 2.4448     | 1700     | 0.0994        | -          | -                                         |
+| 2.5901     | 1800     | 0.0837        | -          | -                                         |
+| 2.7355     | 1900     | 0.076         | -          | -                                         |
+| 2.8808     | 2000     | 0.0645        | -          | -                                         |
+| 3.0131     | 2091     | -             | 0.2412     | -                                         |
+| 3.0131     | 2100     | 0.0607        | -          | -                                         |
+| 3.1584     | 2200     | 0.0609        | -          | -                                         |
+| 3.3038     | 2300     | 0.0503        | -          | -                                         |
+| 3.4491     | 2400     | 0.0483        | -          | -                                         |
+| 3.5945     | 2500     | 0.0402        | -          | -                                         |
+| 3.7398     | 2600     | 0.0397        | -          | -                                         |
+| 3.8852     | 2700     | 0.0305        | -          | -                                         |
+| 4.0131     | 2788     | -             | 0.2196     | -                                         |
+| 4.0174     | 2800     | 0.0304        | -          | -                                         |
+| 4.1628     | 2900     | 0.0307        | -          | -                                         |
+| 4.3081     | 3000     | 0.0256        | -          | -                                         |
+| 4.4535     | 3100     | 0.0258        | -          | -                                         |
+| 4.5988     | 3200     | 0.0212        | -          | -                                         |
+| 4.7442     | 3300     | 0.0213        | -          | -                                         |
+| 4.8895     | 3400     | 0.0174        | -          | -                                         |
+| 5.0131     | 3485     | -             | 0.2036     | -                                         |
+| 5.0218     | 3500     | 0.0191        | -          | -                                         |
+| 5.1672     | 3600     | 0.0198        | -          | -                                         |
+| 5.3125     | 3700     | 0.0161        | -          | -                                         |
+| 5.4578     | 3800     | 0.0166        | -          | -                                         |
+| 5.6032     | 3900     | 0.0135        | -          | -                                         |
+| 5.7485     | 4000     | 0.0145        | -          | -                                         |
+| 5.8939     | 4100     | 0.0129        | -          | -                                         |
+| **5.9346** | **4128** | **-**         | **0.1966** | **0.489**                                 |
 * The bold row denotes the saved checkpoint.

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1796c6e005742413b753de6f83fdd6c3515b94cb1fce753d6adae3c90fe9191d
 size 265462608

 version https://git-lfs.github.com/spec/v1
+oid sha256:6ff4f47578afdd7445b15b66710dfe43895a5be76181400182d87f9d1700cd4f
 size 265462608