fullstack commited on
Commit
313af0d
1 Parent(s): 0543aa3

Upload folder using huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +12 -4
README.md CHANGED
@@ -2,6 +2,9 @@
2
 
3
  A LoRA adapter for unsloth/gemma-2b that determines content indexing suitability using chain-of-thought reasoning.
4
 
 
 
 
5
  ## Technical Specifications
6
 
7
  ### Base Model
@@ -19,16 +22,21 @@ Input XML structure:
19
  <instruction>Determine true or false if the following content is suitable and should be indexed.</instruction>
20
  <suitable>
21
  <content>{input_text}</content>
22
- </suitable>
23
  ```
24
 
25
  Output XML structure:
26
  ```xml
27
- <thinking>{reasoning_process}</thinking>
28
- <category>{content_type}</category>
29
- <should_index>{true|false}</should_index>
 
 
30
  ```
31
 
 
 
 
 
32
  ## Deployment
33
 
34
  ### VLLM Server Setup
 
2
 
3
  A LoRA adapter for unsloth/gemma-2b that determines content indexing suitability using chain-of-thought reasoning.
4
 
5
+ Note: This is used in a pipeline to determine if a context should be indexed or not.
6
+
7
+
8
  ## Technical Specifications
9
 
10
  ### Base Model
 
22
  <instruction>Determine true or false if the following content is suitable and should be indexed.</instruction>
23
  <suitable>
24
  <content>{input_text}</content>
 
25
  ```
26
 
27
  Output XML structure:
28
  ```xml
29
+ <thinking>{reasoning_process}</thinking>
30
+ <category>{content_type}</category>
31
+ <should_index>{true|false}</should_index>
32
+ </suitable>
33
+
34
  ```
35
 
36
+ The model then expects an indefinite list of following <suitable> ... </suitable> so you can few shot it to correct a mistake or enhance the results.
37
+
38
+ Your stop token should be `</suitable>` to ensure the model stops generating after the output.
39
+
40
  ## Deployment
41
 
42
  ### VLLM Server Setup