Update README.md
Browse files
README.md
CHANGED
@@ -14,12 +14,12 @@ tags:
|
|
14 |
- llama
|
15 |
|
16 |
---
|
17 |
-
# SmartLlama-3-Ko-8B
|
18 |
<a href="https://ibb.co/C8Tcw1F"><img src="https://i.ibb.co/QQ1gJbG/smartllama3.png" alt="smartllama3" border="0"></a><br />
|
19 |
|
20 |
SmartLlama-3-Ko-8B is a sophisticated AI model that integrates the capabilities of several advanced language models. This merged model is designed to excel in a variety of tasks ranging from technical problem-solving to multilingual communication.
|
21 |
|
22 |
-
## Merge Details
|
23 |
|
24 |
### Component Models and Contributions
|
25 |
|
@@ -50,11 +50,11 @@ SmartLlama-3-Ko-8B is highly capable and versatile, suitable for:
|
|
50 |
|
51 |
This comprehensive capability makes SmartLlama-3-Ko-8B not only a powerful tool for general-purpose AI tasks but also a specialized resource for industries and applications demanding high levels of technical and linguistic precision.
|
52 |
|
53 |
-
## Merge Method
|
54 |
|
55 |
This model was merged using the [DARE](https://arxiv.org/abs/2311.03099) [TIES](https://arxiv.org/abs/2306.01708) merge method using [NousResearch/Meta-Llama-3-8B](https://huggingface.co/NousResearch/Meta-Llama-3-8B) as a base.
|
56 |
|
57 |
-
## Models Merged
|
58 |
|
59 |
The following models were included in the merge:
|
60 |
* [beomi/Llama-3-Open-Ko-8B-Instruct-preview](https://huggingface.co/beomi/Llama-3-Open-Ko-8B-Instruct-preview)
|
@@ -63,7 +63,7 @@ The following models were included in the merge:
|
|
63 |
* [abacusai/Llama-3-Smaug-8B](https://huggingface.co/abacusai/Llama-3-Smaug-8B)
|
64 |
* [Locutusque/Llama-3-Orca-1.0-8B](https://huggingface.co/Locutusque/Llama-3-Orca-1.0-8B)
|
65 |
|
66 |
-
## Configuration
|
67 |
|
68 |
The following YAML configuration was used to produce this model:
|
69 |
|
@@ -100,7 +100,7 @@ dtype: bfloat16
|
|
100 |
|
101 |
```
|
102 |
|
103 |
-
### Test Result
|
104 |
|
105 |
**Korean Multi Turn Conversation**
|
106 |
<a href="https://ibb.co/v40tkNj"><img src="https://i.ibb.co/hF3qVGm/Screenshot-2024-04-30-at-8-26-57-AM.png" alt="Screenshot-2024-04-30-at-8-26-57-AM" border="0"></a>
|
|
|
14 |
- llama
|
15 |
|
16 |
---
|
17 |
+
# π°π· SmartLlama-3-Ko-8B
|
18 |
<a href="https://ibb.co/C8Tcw1F"><img src="https://i.ibb.co/QQ1gJbG/smartllama3.png" alt="smartllama3" border="0"></a><br />
|
19 |
|
20 |
SmartLlama-3-Ko-8B is a sophisticated AI model that integrates the capabilities of several advanced language models. This merged model is designed to excel in a variety of tasks ranging from technical problem-solving to multilingual communication.
|
21 |
|
22 |
+
## π Merge Details
|
23 |
|
24 |
### Component Models and Contributions
|
25 |
|
|
|
50 |
|
51 |
This comprehensive capability makes SmartLlama-3-Ko-8B not only a powerful tool for general-purpose AI tasks but also a specialized resource for industries and applications demanding high levels of technical and linguistic precision.
|
52 |
|
53 |
+
## ποΈ Merge Method
|
54 |
|
55 |
This model was merged using the [DARE](https://arxiv.org/abs/2311.03099) [TIES](https://arxiv.org/abs/2306.01708) merge method using [NousResearch/Meta-Llama-3-8B](https://huggingface.co/NousResearch/Meta-Llama-3-8B) as a base.
|
56 |
|
57 |
+
## π Models Merged
|
58 |
|
59 |
The following models were included in the merge:
|
60 |
* [beomi/Llama-3-Open-Ko-8B-Instruct-preview](https://huggingface.co/beomi/Llama-3-Open-Ko-8B-Instruct-preview)
|
|
|
63 |
* [abacusai/Llama-3-Smaug-8B](https://huggingface.co/abacusai/Llama-3-Smaug-8B)
|
64 |
* [Locutusque/Llama-3-Orca-1.0-8B](https://huggingface.co/Locutusque/Llama-3-Orca-1.0-8B)
|
65 |
|
66 |
+
## ποΈ Configuration
|
67 |
|
68 |
The following YAML configuration was used to produce this model:
|
69 |
|
|
|
100 |
|
101 |
```
|
102 |
|
103 |
+
### π Test Result
|
104 |
|
105 |
**Korean Multi Turn Conversation**
|
106 |
<a href="https://ibb.co/v40tkNj"><img src="https://i.ibb.co/hF3qVGm/Screenshot-2024-04-30-at-8-26-57-AM.png" alt="Screenshot-2024-04-30-at-8-26-57-AM" border="0"></a>
|