ZihanWang314 commited on
Commit
3399ce9
·
verified ·
1 Parent(s): 2d872f6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -1,3 +1,5 @@
1
  The vanilla model used in our Expert-Specialized Fine-Tuning (ESFT) research paper: https://arxiv.org/abs/2407.01906.
2
 
 
 
3
  For the customized models used in this paper, please refer to https://huggingface.co/deepseek-ai/ESFT-{gate, token}-{task_name}-lite.
 
1
  The vanilla model used in our Expert-Specialized Fine-Tuning (ESFT) research paper: https://arxiv.org/abs/2407.01906.
2
 
3
+ To use this model and specialized expert sets, please refer to the scripts at https://github.com/deepseek-ai/ESFT.
4
+
5
  For the customized models used in this paper, please refer to https://huggingface.co/deepseek-ai/ESFT-{gate, token}-{task_name}-lite.