Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -1,10 +1,25 @@
|
|
1 |
-
|
2 |
-
|
3 |
-
|
4 |
-
|
5 |
-
|
6 |
-
|
7 |
-
|
8 |
-
|
9 |
-
|
10 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
# DeepRetrieval
|
2 |
+
|
3 |
+
## Overview
|
4 |
+
|
5 |
+
DeepRetrieval is a novel approach that uses reinforcement learning (RL) to train Large Language Models (LLMs) for query generation without requiring supervised data. Instead of relying on expensive human-annotated or distilled reference queries, DeepRetrieval enables LLMs to learn through direct trial and error, using retrieval metrics as rewards.
|
6 |
+
|
7 |
+
## Key Features
|
8 |
+
|
9 |
+
- **No Supervision Required**: Eliminates the need for expensive human-annotated or distilled reference queries
|
10 |
+
- **RL-Based Framework**: Uses reinforcement learning to optimize query generation directly for retrieval performance
|
11 |
+
- **Reasoning-Enhanced Generation**: Incorporates a structured generation method with explicit reasoning before query formulation
|
12 |
+
- **State-of-the-Art Performance**: Achieves remarkable results across diverse retrieval tasks
|
13 |
+
- **Parameter Efficiency**: With just 3B parameters, outperforms much larger models like GPT-4o and Claude-3.5-Sonnet
|
14 |
+
|
15 |
+
## Performance Highlights
|
16 |
+
|
17 |
+
- **Literature Search**: Doubles the recall on PubMed (65.07% vs previous SOTA 24.68%) and ClinicalTrials.gov (63.18% vs previous SOTA 32.11%)
|
18 |
+
- **Evidence-Seeking Retrieval**: Achieves performance equivalent to industry-leading LLMs on NQ and TriviaQA, and significantly outperforms them on SQuAD
|
19 |
+
- **Classic IR**: Shows superior performance across diverse retrieval benchmarks
|
20 |
+
- **SQL Database Search**: Excels in text-to-SQL generation for database search
|
21 |
+
|
22 |
+
|
23 |
+
## About
|
24 |
+
|
25 |
+
DeepRetrieval was developed by researchers from the University of Illinois Urbana-Champaign. For more information, visit the [GitHub repository](https://github.com/pat-jj/DeepRetrieval).
|