Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -235,6 +235,26 @@ Average: 41.65
 | | |mc2 |0.5911|± |0.0158|
 ```
 # Inference Code
 Here is example code using HuggingFace Transformers to inference the model (note: in 4bit, it will require around 5GB of VRAM)

 | | |mc2 |0.5911|± |0.0158|
 ```
+# Function Calling Evaluations
+We worked with Fireworks.AI on evaluations by starting off with their Function Calling eval dataset, fixing some unsolveable ones, and generating a second eval dataset for JSON mode.
+## Function Calling Accuracy: 91%
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/XF3Zii4-QhE2yjWwHr_v4.png)
+## JSON Mode Accuracy: 84%
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/8H2iyjh5wyP2FtLq2LCed.png)
+Run the evaluator yourself using @interstellarninja's codebase here:
+https://github.com/interstellarninja/function-calling-eval
+You can find the evaluation datasets here:
+https://huggingface.co/datasets/NousResearch/func-calling-eval
+https://huggingface.co/datasets/NousResearch/json-mode-eval
 # Inference Code
 Here is example code using HuggingFace Transformers to inference the model (note: in 4bit, it will require around 5GB of VRAM)