Update README.md
Browse files
README.md
CHANGED
@@ -88,10 +88,17 @@ but some how it did not build as some parameters was not loading. ie it would no
|
|
88 |
the model was trained to perfection so it still works fine!
|
89 |
the lora was made so tat later it can be loaded with the model for further training of the effected tensors...
|
90 |
|
91 |
-
|
92 |
-
### Merge Method
|
93 |
|
94 |
-
This
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
95 |
|
96 |
### Models Merged
|
97 |
|
|
|
88 |
the model was trained to perfection so it still works fine!
|
89 |
the lora was made so tat later it can be loaded with the model for further training of the effected tensors...
|
90 |
|
91 |
+
his Expert is a companon to the MEGA_MIND 24b CyberSeries represents a groundbreaking leap in the realm of language models, integrating a diverse array of expert models into a unified framework. At its core lies the Mistral-7B-Instruct-v0.2, a refined instructional model designed for versatility and efficiency.
|
|
|
92 |
|
93 |
+
Enhanced with an expanded context window and advanced routing mechanisms, the Mistral-7B-Instruct-v0.2 exemplifies the power of Mixture of Experts, allowing seamless integration of specialized sub-models. This architecture facilitates unparalleled performance and scalability, enabling the CyberSeries to tackle a myriad of tasks with unparalleled speed and accuracy.
|
94 |
+
|
95 |
+
Among its illustrious sub-models, the OpenOrca - Mistral-7B-8k shines as a testament to fine-tuning excellence, boasting top-ranking performance in its class. Meanwhile, the Hermes 2 Pro introduces cutting-edge capabilities such as Function Calling and JSON Mode, catering to diverse application needs.
|
96 |
+
|
97 |
+
Driven by Reinforcement Learning from AI Feedback, the Starling-LM-7B-beta demonstrates remarkable adaptability and optimization, while the Phi-1.5 Transformer model stands as a beacon of excellence across various domains, from common sense reasoning to medical inference.
|
98 |
+
|
99 |
+
With models like BioMistral tailored specifically for medical applications and Nous-Yarn-Mistral-7b-128k excelling in handling long-context data, the MEGA_MIND 24b CyberSeries emerges as a transformative force in the landscape of language understanding and artificial intelligence.
|
100 |
+
|
101 |
+
Experience the future of language models with the MEGA_MIND 24b CyberSeries, where innovation meets performance, and possibilities are limitless.
|
102 |
|
103 |
### Models Merged
|
104 |
|