Update README.md
Browse files
README.md
CHANGED
@@ -202,10 +202,6 @@ Each argument was constructed by selecting two random propositions (from a list
|
|
202 |
|
203 |
<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
|
204 |
|
205 |
-
#### Preprocessing
|
206 |
-
|
207 |
-
[More Information Needed]
|
208 |
-
We converted the data to the Alpaca chat format before feeding it to the model.
|
209 |
|
210 |
#### Training
|
211 |
|
@@ -213,7 +209,7 @@ We used unsloth for memory reduced sped up training.
|
|
213 |
|
214 |
We trained for one epoch.
|
215 |
|
216 |
-
Less than
|
217 |
|
218 |
## Evaluation
|
219 |
|
|
|
202 |
|
203 |
<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
|
204 |
|
|
|
|
|
|
|
|
|
205 |
|
206 |
#### Training
|
207 |
|
|
|
209 |
|
210 |
We trained for one epoch.
|
211 |
|
212 |
+
Less than 3.5 GB of VRAM were used for training, and it took 3 hours.
|
213 |
|
214 |
## Evaluation
|
215 |
|