Update README.md
Browse files
README.md
CHANGED
@@ -15,10 +15,14 @@ tags:
|
|
15 |
</div>
|
16 |
|
17 |
Are you looking for a more robust and reliable generation model for RAG system?
|
|
|
18 |
Here is a Ext2Gen-8B-R2 model that effectively mitigates hallucinations caused by retrieval noise and information overload.
|
19 |
|
|
|
|
|
20 |
### What is Ext2Gen-8B-R2?
|
21 |
Ext2Gen-8B-R2 is built upon Llama3.2-8B-Instruct, incorporating preference-aligned fine-tuning through pairwise feedback learning.
|
|
|
22 |
This training strategy enables the model to:
|
23 |
- Extract highly relevant sentences from retrieved chunks before generating an answer.
|
24 |
- Filter out irrelevant or misleading information, reducing hallucinations.
|
@@ -30,8 +34,19 @@ Standard RAG models often struggle due to:
|
|
30 |
- Information Overload – The presence of irrelevant chunks can distract the model, leading to errors or hallucinations.
|
31 |
- Lack of Alignment – Most generation models are not explicitly trained to prioritize relevant content over noise.
|
32 |
|
|
|
|
|
|
|
|
|
|
|
33 |
### Performance Benchmark
|
34 |
-
Our
|
|
|
|
|
|
|
|
|
|
|
|
|
35 |
|
36 |
|
37 |
|
|
|
15 |
</div>
|
16 |
|
17 |
Are you looking for a more robust and reliable generation model for RAG system?
|
18 |
+
|
19 |
Here is a Ext2Gen-8B-R2 model that effectively mitigates hallucinations caused by retrieval noise and information overload.
|
20 |
|
21 |
+
See the details in our paper [Link](https://arxiv.org/pdf/2503.04789)
|
22 |
+
|
23 |
### What is Ext2Gen-8B-R2?
|
24 |
Ext2Gen-8B-R2 is built upon Llama3.2-8B-Instruct, incorporating preference-aligned fine-tuning through pairwise feedback learning.
|
25 |
+
|
26 |
This training strategy enables the model to:
|
27 |
- Extract highly relevant sentences from retrieved chunks before generating an answer.
|
28 |
- Filter out irrelevant or misleading information, reducing hallucinations.
|
|
|
34 |
- Information Overload – The presence of irrelevant chunks can distract the model, leading to errors or hallucinations.
|
35 |
- Lack of Alignment – Most generation models are not explicitly trained to prioritize relevant content over noise.
|
36 |
|
37 |
+
### Prompt
|
38 |
+
|
39 |
+
TBD
|
40 |
+
|
41 |
+
|
42 |
### Performance Benchmark
|
43 |
+
Our evaluations demonstrate that Ext2Gen-8B-R2 significantly enhances robustness in RAG systems:
|
44 |
+
* We conduct a QA task using RAG Systems on NQ, MS-MARCO, HotpotQA datasets.
|
45 |
+
* The difference is the generation backbone: Llama3.1-8B-Instruct vs. Ext2Gen-8B-R2
|
46 |
+
|
47 |
+
See the results in the Figure below:
|
48 |
+
|
49 |
+

|
50 |
|
51 |
|
52 |
|