ZihanWang314 commited on
Commit
c7873fd
·
verified ·
1 Parent(s): e0522c3

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -0
README.md ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ The vanilla model used in our Expert-Specialized Fine-Tuning (ESFT) research paper: https://arxiv.org/abs/2407.01906.
2
+
3
+ For the customized models used in this paper, please refer to https://huggingface.co/deepseek-ai/ESFT-{gate, token}-{task_name}-lite