Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -39,7 +39,7 @@ accross various devices, can be found [here](https://aihub.qualcomm.com/models/m
|
|
39 |
- Decoding length: 4096
|
40 |
- Use: Initiate conversation with prompt-processor and then token generator for subsequent iterations.
|
41 |
|
42 |
-
| Model | Device | Chipset | Target Runtime | Response Rate (tokens per second) | Time To First Token (range, seconds) | Tiny MMLU
|
43 |
|---|---|---|---|---|---|---|
|
44 |
| Mistral-7B-Instruct-v0_3 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | QNN | 10.73 | 0.18 - 5.79 | 58.85% | Use Export Script |
|
45 |
|
|
|
39 |
- Decoding length: 4096
|
40 |
- Use: Initiate conversation with prompt-processor and then token generator for subsequent iterations.
|
41 |
|
42 |
+
| Model | Device | Chipset | Target Runtime | Response Rate (tokens per second) | Time To First Token (range, seconds) | Tiny MMLU |
|
43 |
|---|---|---|---|---|---|---|
|
44 |
| Mistral-7B-Instruct-v0_3 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | QNN | 10.73 | 0.18 - 5.79 | 58.85% | Use Export Script |
|
45 |
|