--- license: mit --- Best-trained LEAD model checkpoints. The number in the file name represents the epochs of model training, and `dpr.biencoder.70` has the best performance. Please refer to our paper and github repo for more details. Paper: [Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs](https://arxiv.org/abs/2410.06581) Github repo: https://github.com/thunlp/LEAD LEAD dataset: https://huggingface.co/datasets/JamesChengGao/LEAD