Pedram Rostami
commited on
Commit
•
49513bb
1
Parent(s):
9aad44b
Update README.md
Browse files
README.md
CHANGED
@@ -73,7 +73,7 @@ input_tokens = input_tokens.to(device)
|
|
73 |
generate_ids = model.generate(**input_tokens, max_new_tokens=512, do_sample=False, repetition_penalty=1.1)
|
74 |
model_output = tokenizer.batch_decode(generate_ids, skip_special_tokens=True, clean_up_tokenization_spaces=False)[0]
|
75 |
|
76 |
-
print(model_output[len(model_input)
|
77 |
```
|
78 |
|
79 |
### How to Quantize the Model
|
|
|
73 |
generate_ids = model.generate(**input_tokens, max_new_tokens=512, do_sample=False, repetition_penalty=1.1)
|
74 |
model_output = tokenizer.batch_decode(generate_ids, skip_special_tokens=True, clean_up_tokenization_spaces=False)[0]
|
75 |
|
76 |
+
print(model_output[len(model_input):])
|
77 |
```
|
78 |
|
79 |
### How to Quantize the Model
|