Update README.md
Browse files
README.md
CHANGED
@@ -13,16 +13,16 @@ tags:
|
|
13 |
|
14 |
FINAL BENCHMARKING
|
15 |
------------------------------
|
16 |
-
Time to First Token (TTFT)
|
17 |
-
Time Per Output Token (TPOT)
|
18 |
-
Throughput (token/s)
|
19 |
-
Average Token Latency (ms/token)
|
20 |
-
Total Generation Time
|
21 |
-
Input Tokenization Time
|
22 |
-
Input Tokens
|
23 |
-
Output Tokens
|
24 |
-
Total Tokens
|
25 |
-
Memory Usage (GPU)
|
26 |
|
27 |
# Uploaded model
|
28 |
|
|
|
13 |
|
14 |
FINAL BENCHMARKING
|
15 |
------------------------------
|
16 |
+
- **Time to First Token (TTFT)**: 0.001s
|
17 |
+
- **Time Per Output Token (TPOT):** 41.83ms/token
|
18 |
+
- **Throughput (token/s):** 24.35token/s
|
19 |
+
- **Average Token Latency (ms/token):** 41.92ms/token
|
20 |
+
- **Total Generation Time:** 18.427s
|
21 |
+
- **Input Tokenization Time:** 0.009s
|
22 |
+
- **Input Tokens:** 1909
|
23 |
+
- **Output Tokens:** 443
|
24 |
+
- **Total Tokens:** 2352
|
25 |
+
- **Memory Usage (GPU):** 3.38GB
|
26 |
|
27 |
# Uploaded model
|
28 |
|