deepseek-ai
/

ESFT-vanilla-lite

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ZihanWang314 commited on Jul 5, 2024

Commit

3399ce9

·

verified ·

1 Parent(s): 2d872f6

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -1,3 +1,5 @@
 The vanilla model used in our Expert-Specialized Fine-Tuning (ESFT) research paper: https://arxiv.org/abs/2407.01906.
 For the customized models used in this paper, please refer to https://huggingface.co/deepseek-ai/ESFT-{gate, token}-{task_name}-lite.

 The vanilla model used in our Expert-Specialized Fine-Tuning (ESFT) research paper: https://arxiv.org/abs/2407.01906.
+To use this model and specialized expert sets, please refer to the scripts at https://github.com/deepseek-ai/ESFT.
 For the customized models used in this paper, please refer to https://huggingface.co/deepseek-ai/ESFT-{gate, token}-{task_name}-lite.