Hwanjun commited on
Commit
b555bb8
·
verified ·
1 Parent(s): c7b1e81

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -1
README.md CHANGED
@@ -15,10 +15,14 @@ tags:
15
  </div>
16
 
17
  Are you looking for a more robust and reliable generation model for RAG system?
 
18
  Here is a Ext2Gen-8B-R2 model that effectively mitigates hallucinations caused by retrieval noise and information overload.
19
 
 
 
20
  ### What is Ext2Gen-8B-R2?
21
  Ext2Gen-8B-R2 is built upon Llama3.2-8B-Instruct, incorporating preference-aligned fine-tuning through pairwise feedback learning.
 
22
  This training strategy enables the model to:
23
  - Extract highly relevant sentences from retrieved chunks before generating an answer.
24
  - Filter out irrelevant or misleading information, reducing hallucinations.
@@ -30,8 +34,19 @@ Standard RAG models often struggle due to:
30
  - Information Overload – The presence of irrelevant chunks can distract the model, leading to errors or hallucinations.
31
  - Lack of Alignment – Most generation models are not explicitly trained to prioritize relevant content over noise.
32
 
 
 
 
 
 
33
  ### Performance Benchmark
34
- Our extensive evaluations demonstrate that Ext2Gen-8B-R2 significantly enhances robustness in RAG systems:
 
 
 
 
 
 
35
 
36
 
37
 
 
15
  </div>
16
 
17
  Are you looking for a more robust and reliable generation model for RAG system?
18
+
19
  Here is a Ext2Gen-8B-R2 model that effectively mitigates hallucinations caused by retrieval noise and information overload.
20
 
21
+ See the details in our paper [Link](https://arxiv.org/pdf/2503.04789)
22
+
23
  ### What is Ext2Gen-8B-R2?
24
  Ext2Gen-8B-R2 is built upon Llama3.2-8B-Instruct, incorporating preference-aligned fine-tuning through pairwise feedback learning.
25
+
26
  This training strategy enables the model to:
27
  - Extract highly relevant sentences from retrieved chunks before generating an answer.
28
  - Filter out irrelevant or misleading information, reducing hallucinations.
 
34
  - Information Overload – The presence of irrelevant chunks can distract the model, leading to errors or hallucinations.
35
  - Lack of Alignment – Most generation models are not explicitly trained to prioritize relevant content over noise.
36
 
37
+ ### Prompt
38
+
39
+ TBD
40
+
41
+
42
  ### Performance Benchmark
43
+ Our evaluations demonstrate that Ext2Gen-8B-R2 significantly enhances robustness in RAG systems:
44
+ * We conduct a QA task using RAG Systems on NQ, MS-MARCO, HotpotQA datasets.
45
+ * The difference is the generation backbone: Llama3.1-8B-Instruct vs. Ext2Gen-8B-R2
46
+
47
+ See the results in the Figure below:
48
+
49
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/63c9da8d5fdc575773c84816/4mbreGv3QNxKOY8HzCLxx.png)
50
 
51
 
52