ZihanWang314
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,5 @@
|
|
1 |
The vanilla model used in our Expert-Specialized Fine-Tuning (ESFT) research paper: https://arxiv.org/abs/2407.01906.
|
2 |
|
|
|
|
|
3 |
For the customized models used in this paper, please refer to https://huggingface.co/deepseek-ai/ESFT-{gate, token}-{task_name}-lite.
|
|
|
1 |
The vanilla model used in our Expert-Specialized Fine-Tuning (ESFT) research paper: https://arxiv.org/abs/2407.01906.
|
2 |
|
3 |
+
To use this model and specialized expert sets, please refer to the scripts at https://github.com/deepseek-ai/ESFT.
|
4 |
+
|
5 |
For the customized models used in this paper, please refer to https://huggingface.co/deepseek-ai/ESFT-{gate, token}-{task_name}-lite.
|