Spaces:

sentence-transformers
/

backend-export

Running on T4

Tom Aarsen commited on Nov 12, 2024

Commit

810f572

1 Parent(s): ebb05ef

Wrap details/summary in HTML instead

Files changed (1) hide show

app.py CHANGED Viewed

@@ -645,7 +645,11 @@ with gr.Blocks(
 Sentence Transformers embedding models can be optimized for **faster inference** on CPU and GPU devices by exporting, quantizing, and optimizing them in ONNX and OpenVINO formats.
 Observe the [Speeding up Inference](https://sbert.net/docs/sentence_transformer/usage/efficiency.html) documentation for more information.
 <details><summary>Click to see performance benchmarks</summary>
 <table>
@@ -667,18 +671,16 @@ Observe the [Speeding up Inference](https://sbert.net/docs/sentence_transformer/
   </tbody>
 </table>
-* `onnx` refers to the ONNX backend
-* `onnx-qint8` refers to ONNX (Dynamic Quantization)
-* `onnx-O1` to `onnx-O4` refers to ONNX (Optimization)
-* `openvino` refers to the OpenVINO backend
-* `openvino-qint8` refers to OpenVINO (Static Quantization)
 </details>
-""",
-                label="",
-                container=True,
-            )
             model_id = HuggingfaceHubSearch(
                 label="Hub Model ID",

 Sentence Transformers embedding models can be optimized for **faster inference** on CPU and GPU devices by exporting, quantizing, and optimizing them in ONNX and OpenVINO formats.
 Observe the [Speeding up Inference](https://sbert.net/docs/sentence_transformer/usage/efficiency.html) documentation for more information.
+""",
+                label="",
+                container=True,
+            )
+            gr.HTML(value="""\
 <details><summary>Click to see performance benchmarks</summary>
 <table>
   </tbody>
 </table>
+<ul>
+  <li><code>onnx</code> refers to the ONNX backend</li>
+  <li><code>onnx-qint8</code> refers to ONNX (Dynamic Quantization)</li>
+  <li><code>onnx-O1</code> to <code>onnx-O4</code> refers to ONNX (Optimization)</li>
+  <li><code>openvino</code> refers to the OpenVINO backend</li>
+  <li><code>openvino-qint8</code> refers to OpenVINO (Static Quantization)</li>
+</ul>
 </details>
+""")
             model_id = HuggingfaceHubSearch(
                 label="Hub Model ID",