UsernameJustAnother
commited on
Commit
•
585995c
1
Parent(s):
ff69660
Update README.md
Browse files
README.md
CHANGED
@@ -45,7 +45,7 @@ The aim here is for a solid RP/storywriting model that will fit in 16GB of VRAM
|
|
45 |
|
46 |
I pulled v7 because I honestly don't think it's as good as v6, and don't want folks to get the wrong idea that it's better just because the version number is higher. Besides, nothing good ever fires on all _seven_ cylinders.
|
47 |
|
48 |
-
Props again to [Daniel](https://huggingface.co/danielhanchen) and [Unsloth](https://huggingface.co/unsloth) for writing magic that lets me train this on a single A100 with variable (wildly variable) context length.
|
49 |
|
50 |
Here's what the train/eval loss looked like:
|
51 |
|
|
|
45 |
|
46 |
I pulled v7 because I honestly don't think it's as good as v6, and don't want folks to get the wrong idea that it's better just because the version number is higher. Besides, nothing good ever fires on all _seven_ cylinders.
|
47 |
|
48 |
+
Props again to [Daniel](https://huggingface.co/danielhanchen) and [Unsloth](https://huggingface.co/unsloth) for writing magic that lets me train this on a single A100 with variable (wildly variable) context length. [The docker image I used to run Unsloth on runpod is here](https://hub.docker.com/r/usernamejustanother/runpod_unsloth).
|
49 |
|
50 |
Here's what the train/eval loss looked like:
|
51 |
|