Elizezen
/

Omnia-2x7B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Elizezen commited on Apr 23

Commit

a0f145c

•

1 Parent(s): 0f1a3c8

Update README.md

Files changed (1) hide show

README.md +15 -1

README.md CHANGED Viewed

@@ -24,6 +24,18 @@ In extensive testing and benchmarks, Omnia has proven to be an exceptionally str
 ## Benchmark Results
 **Benchmark Metrics:**
 ### Complexity
@@ -49,4 +61,6 @@ Similarity: 0.969
 ```
-*While the benchmark provides some insights, it is important to consider that the specific undisclosed details of the benchmark may introduce biases. Therefore, it is recommended to take this result with a grain of salt for now.*

 ## Benchmark Results
+| Model                                  | average  | complexity | contextual maintenance | similarity to ground truth |
+| -------------------------------------- | -------- | ---------- | ---------------------- | -------------------------- |
+| **Omnia-2x7B**                         | **82.2** | **55.5**   | **95.9**               | **95.3**                   |
+| Omnia-7B                               | 81.8     | 55.0       | 95.7                   | 94.8                       |
+| Antler-RP-ja-westlake-chatvector       | 81.6     | 54.2       | 95.6                   | 95.0                         |
+| LightChatAssistant-TypeB-2x7B          | 81.5     | 55.1    | 95.0                     | 94.5                       |
+| Antler-7B                              | 81.5     | 53.7       | 95.8                   | 95.1                       |
+| chatntq-ja-7b-v1.0-westlake-chatvector | 79.7     | 49.8       | 95.0                     | 94.3                       |
+| chatntq-ja-7b-v1.0                     | 79.7     | 52.0       | 93.9                   | 93.2                       |
 **Benchmark Metrics:**
 ### Complexity
 ```
+*While the benchmark provides some insights, it is important to consider that the specific undisclosed details of the benchmark may introduce biases. Therefore, it is recommended to take this result with a grain of salt for now.*
+If you want to know the full generation results of the benchmark, please contact me at [[email protected]](mailto:[email protected])