qualcomm
/

Mistral-7B-Instruct-v0_3

@@ -39,7 +39,7 @@ accross various devices, can be found [here](https://aihub.qualcomm.com/models/m
   - Decoding length: 4096
   - Use: Initiate conversation with prompt-processor and then token generator for subsequent iterations.
-| Model | Device | Chipset | Target Runtime | Response Rate (tokens per second) | Time To First Token (range, seconds) | Tiny MMLU
 |---|---|---|---|---|---|---|
 | Mistral-7B-Instruct-v0_3 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | QNN | 10.73 | 0.18 - 5.79 | 58.85% | Use Export Script |

   - Decoding length: 4096
   - Use: Initiate conversation with prompt-processor and then token generator for subsequent iterations.
+| Model | Device | Chipset | Target Runtime | Response Rate (tokens per second) | Time To First Token (range, seconds) | Tiny MMLU |
 |---|---|---|---|---|---|---|
 | Mistral-7B-Instruct-v0_3 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | QNN | 10.73 | 0.18 - 5.79 | 58.85% | Use Export Script |