DeBERTa-ST-AllLayers-v3.1bis / config_sentence_transformers.json
bobox's picture
KL divergence loss layers selfdistill....Multi step multi task training.
869170b verified
raw
history blame contribute delete
195 Bytes
{
"__version__": {
"sentence_transformers": "3.0.1",
"transformers": "4.42.3",
"pytorch": "2.1.2"
},
"prompts": {},
"default_prompt_name": null,
"similarity_fn_name": null
}