Envoid commited on
Commit
379932d
1 Parent(s): 7fc7fb8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -12,4 +12,5 @@ It responds well to Alpaca instruct style prompt formatting.
12
  It can be a little rude at times and doesn't have Dendrite's ego and thirst for philosophical discussion but I feel that it's overall it's a much better general purpose model.
13
  It does occasionally output grammatical errors during RP so might need a few more epochs to better fit the training data.
14
  If you are role playing using the SillyTavern+SimpleProxy stack it does have a tendency to run away with a scene when using the verbose.mjs prompt format. The singleline.mjs format sometimes remedies this issue however it also causes some characters to give very short, dull replies. So achieving a balance might require a complete new custom prompt format.
15
- ## use_cache was originally set to false when uploaded this has now been remedied. recommended edit or redownload config.
 
 
12
  It can be a little rude at times and doesn't have Dendrite's ego and thirst for philosophical discussion but I feel that it's overall it's a much better general purpose model.
13
  It does occasionally output grammatical errors during RP so might need a few more epochs to better fit the training data.
14
  If you are role playing using the SillyTavern+SimpleProxy stack it does have a tendency to run away with a scene when using the verbose.mjs prompt format. The singleline.mjs format sometimes remedies this issue however it also causes some characters to give very short, dull replies. So achieving a balance might require a complete new custom prompt format.
15
+ ## use_cache was originally set to false when uploaded this has now been remedied. recommended edit or redownload config.
16
+ ## I have been asked about GPTQ for this model unfortunately there seems to be some weird vocabulary mismatch that causes GPTQ to corrupt the model. So the only way to run it in 4bit at the moment is to load the FP16 model in 4bit via transformers.