DISLab
/

Ext2Gen-8B-R2

Question Answering

text-generation

text-generation-inference

Model card Files Files and versions Community

Hwanjun commited on Mar 12

Commit

ee34ee3

·

verified ·

1 Parent(s): 1e3c04d

Update README.md

Files changed (1) hide show

README.md +16 -0

README.md CHANGED Viewed

@@ -42,6 +42,11 @@ Standard RAG models often struggle due to:
 - chunk_list: the list of retrieved chunks, e.g., ["chunk 1", "chunk 2", "chunk 3"]
 ```python
 def format_prompt_template(query, chunk_list):
     chunk_list = ['[Chunk ID: '+ str(idx+1) + '] ' + chunk_text for idx, chunk_text in enumerate(chunk_list)]
@@ -87,6 +92,9 @@ Answer: Your Answer
     return prompt.strip()
 ```
@@ -103,6 +111,14 @@ Answer: The estimated number of deaths at Chelmno is 150-300,000, mainly Jews.
 The number of extracted sentences vary depending on the QA.
 ### Performance Benchmark
 Our evaluations demonstrate that Ext2Gen-8B-R2 significantly enhances robustness in RAG systems:

 - chunk_list: the list of retrieved chunks, e.g., ["chunk 1", "chunk 2", "chunk 3"]
 ```python
+def prepare_sample_text(prompt):
+    row_json = [{"role": "user", "content": prompt}]
+    return tokenizer.apply_chat_template(row_json, tokenize=False)
 def format_prompt_template(query, chunk_list):
     chunk_list = ['[Chunk ID: '+ str(idx+1) + '] ' + chunk_text for idx, chunk_text in enumerate(chunk_list)]
     return prompt.strip()
+prompt = format_prompt_template(query, noisy_chunks)
+prompt =  prepare_sample_text(prompt)
 ```
 The number of extracted sentences vary depending on the QA.
+### Generation Parameters
+```python
+max_new_tokens=1024, # or 2048
+do_sample=True,
+temperature=0.8,
+top_p=0.9,
+```
 ### Performance Benchmark
 Our evaluations demonstrate that Ext2Gen-8B-R2 significantly enhances robustness in RAG systems: