Update README.md
Browse files
README.md
CHANGED
@@ -12,8 +12,8 @@ tags:
|
|
12 |
# Hermes-2.5-Yi-1.5-9B-Chat
|
13 |
|
14 |
This model is a fine-tuned version of [01-ai/Yi-1.5-9B-Chat](https://huggingface.co/01-ai/Yi-1.5-9B-Chat) on the [teknium/OpenHermes-2.5](https://huggingface.co/datasets/teknium/OpenHermes-2.5) dataset.
|
15 |
-
I'm very happy with the results. The model now seems a lot smarter and "aware" in certain situations. It got quite an big edge on the AGIEval Benchmark for models in it's class
|
16 |
-
I plan to extend
|
17 |
|
18 |
## Model Details
|
19 |
|
@@ -94,6 +94,7 @@ This model is released under the Apache 2.0 license.
|
|
94 |
Special thanks to:
|
95 |
- Teknium for the great OpenHermes-2.5 dataset
|
96 |
- 01-ai for their great model
|
|
|
97 |
|
98 |
## Citation
|
99 |
|
|
|
12 |
# Hermes-2.5-Yi-1.5-9B-Chat
|
13 |
|
14 |
This model is a fine-tuned version of [01-ai/Yi-1.5-9B-Chat](https://huggingface.co/01-ai/Yi-1.5-9B-Chat) on the [teknium/OpenHermes-2.5](https://huggingface.co/datasets/teknium/OpenHermes-2.5) dataset.
|
15 |
+
I'm very happy with the results. The model now seems a lot smarter and "aware" in certain situations. It got quite an big edge on the AGIEval Benchmark for models in it's class.
|
16 |
+
I plan to extend its context length to 32k with POSE.
|
17 |
|
18 |
## Model Details
|
19 |
|
|
|
94 |
Special thanks to:
|
95 |
- Teknium for the great OpenHermes-2.5 dataset
|
96 |
- 01-ai for their great model
|
97 |
+
- KIT SCC for FLOPS
|
98 |
|
99 |
## Citation
|
100 |
|