Spaces:

arcee-ai
/

Benchmarks

Running

Julien Simon commited on Sep 9, 2024

Commit

37a6a80

•

1 Parent(s): e009fe7

Add inf2.xlarge benchmark

Files changed (1) hide show

results_llama_spark.py CHANGED Viewed

@@ -105,5 +105,12 @@ results_llama_spark = {
             "tokensPerSecond": "-",
             "notes": "Llama-3.1: TGI OK, Neuron SDK OK, optimum-neuron KO",
         },
     ],
 }

             "tokensPerSecond": "-",
             "notes": "Llama-3.1: TGI OK, Neuron SDK OK, optimum-neuron KO",
         },
+        {
+            "instanceType": "inf2.2xlarge",
+            "container": "transformers-neuronx 0.11.351",
+            "status": "OK",
+            "tokensPerSecond": "24",
+            "notes": "Neuron SDK 2.19.1",
+        },
     ],
 }