liang.zhao commited on
Commit
2798948
1 Parent(s): 1177f07

update model and config

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -36,7 +36,7 @@ As of September 2024, Skywork-Critic-Llama3.1-8B ranks first on RewardBench for
36
 
37
  | Model | Chat | Chat Hard | Safety | Reasoning | Overall Score |
38
  | ------------------------------- | :---: | :-------: | :----: | :-------: | :---: |
39
- | Skywork-Critic-Llama3.1-8B * | **93.6** | **81.4** | **91.1** | **89.8** | **89.0** |
40
  | Salesforce/SFR-LLaMa-3.1-8B-Judge-r | 95.5 | 77.7 | 86.2 | 95.1 | 88.7 |
41
  | facebook/Self-taught-Llama-3-70B | 96.9 | 84.0 | 91.1 | 82.5 | 88.6 |
42
  | google/gemini-1.5-pro-0514 | 92.3 | 80.6 | 87.9 | 92.0 | 88.2 |
 
36
 
37
  | Model | Chat | Chat Hard | Safety | Reasoning | Overall Score |
38
  | ------------------------------- | :---: | :-------: | :----: | :-------: | :---: |
39
+ | **Skywork-Critic-Llama3.1-8B** * | **93.6** | **81.4** | **91.1** | **89.8** | **89.0** |
40
  | Salesforce/SFR-LLaMa-3.1-8B-Judge-r | 95.5 | 77.7 | 86.2 | 95.1 | 88.7 |
41
  | facebook/Self-taught-Llama-3-70B | 96.9 | 84.0 | 91.1 | 82.5 | 88.6 |
42
  | google/gemini-1.5-pro-0514 | 92.3 | 80.6 | 87.9 | 92.0 | 88.2 |