Update README.md
Browse files
README.md
CHANGED
@@ -14,7 +14,7 @@ tags:
|
|
14 |
|
15 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
16 |
|
17 |
-
Following up on [tomasmcm/sky-t1-coder-32b-flash](https://huggingface.co/tomasmcm/sky-t1-coder-32b-flash), this experiment tries to merge 2 reasoning models based on Qwen 32B with a Coder model.
|
18 |
|
19 |
|
20 |
## Merge Details
|
|
|
14 |
|
15 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
16 |
|
17 |
+
Following up on [tomasmcm/sky-t1-coder-32b-flash](https://huggingface.co/tomasmcm/sky-t1-coder-32b-flash), this experiment tries to merge 2 reasoning models based on Qwen 32B with a Coder model. But it seems to have caused the model to loose it's thinking abilities, even when adding `<think>` to the prompt.
|
18 |
|
19 |
|
20 |
## Merge Details
|