Update README.md

Browse files

Files changed (1) hide show

README.md +10 -10

README.md CHANGED Viewed

@@ -51,10 +51,10 @@ Models initially developed in frameworks like PyTorch can be converted to GGUF f
 Here are the quantized versions that I have available:
-- [ ] Q2_K
-- [ ] Q3_K_S
-- [ ] Q3_K_M
-- [ ] Q3_K_L
 - [x] Q4_K_S
 - [x] Q4_K_M ~ *Recommended*
 - [x] Q5_K_S ~ *Recommended*
@@ -68,7 +68,7 @@ Feel Free to reach out to me if you need a specific Quantization Type that I do
 ### 📈All Quantization Types Possible
-Below is a table of all the Quantication Types that are possible as well as short descriptions.
 | **#** | **or** | **Q#** | **:** | _Description Of Quantization Types_                            |
 |-------|:------:|:------:|:-----:|----------------------------------------------------------------|
@@ -97,7 +97,6 @@ Below is a table of all the Quantication Types that are possible as well as shor
 By using a GGUF version of Llama-3.3-70B-Instruct, you will be able to run this LLM while having to use significantly less resources than you would using the non quantized version.
 This also allows you to run this 70B Model on a machine with less memory than a non quantized version.
 ## ⚙️️Installation
 --------------------------------------------
 Here are 2 different methods you can use to run the quantized versions of Llama-3.3-70B-Instruct
@@ -134,6 +133,7 @@ git clone https://github.com/oobabooga/text-generation-webui.git
 Ollama runs as a local service.
 Although it technically works using a command-line interface, Ollama's best attribute is their REST API.
 Being able to utilize your locally ran LLMs through the use of this API can give you almost endless possibilities!
 *Feel free to reach out to me if you would like to know some examples that I use this API for*
 #### ☑️  How to install Ollama
@@ -143,16 +143,16 @@ https://ollama.com/download
 ```
 Using Windows, or Mac you will then download a file and run it.
 If you are using linux it will just provide a single command that you need to run in your terminal window.
-*Thats about it for installing Ollama*
 #### ✅Using Llama-3.3-70B-Instruct-GGUF with  Ollama
 Ollama does have a Model Library where you can download models:
 ```shell
 https://ollama.com/library
 ```
 This Model Library offers many different LLM versions that you can use.
-However at the time of writing this, there is no version of llama 3.3 Instruct offered in the ollama library.
-If you would like to use Llama 3.3-Instruct (70B), do the following:
 | #  | Running the 70B quantized version of Llama 3.3-Instruct with Ollama                          |
 |----|----------------------------------------------------------------------------------------------|
@@ -165,7 +165,7 @@ ollama run hf.co/hierholzer/Llama-3.3-70B-Instruct-GGUF:Q4_K_M
 *Replace Q4_K_M with whatever version you would like to use from this repository.*
 | #  | Running the 70B quantized version of Llama 3.3-Instruct with Ollama - *continued* |
 |----|-----------------------------------------------------------------------------------|
-| 3. | This will download & run the model. It will also be saved for furture use.        |
 -------------------------------------------------

 Here are the quantized versions that I have available:
+- [x] Q2_K
+- [x] Q3_K_S
+- [x] Q3_K_M
+- [x] Q3_K_L
 - [x] Q4_K_S
 - [x] Q4_K_M ~ *Recommended*
 - [x] Q5_K_S ~ *Recommended*
 ### 📈All Quantization Types Possible
+Below is a table of all the Quantization Types that are possible as well as short descriptions.
 | **#** | **or** | **Q#** | **:** | _Description Of Quantization Types_                            |
 |-------|:------:|:------:|:-----:|----------------------------------------------------------------|
 By using a GGUF version of Llama-3.3-70B-Instruct, you will be able to run this LLM while having to use significantly less resources than you would using the non quantized version.
 This also allows you to run this 70B Model on a machine with less memory than a non quantized version.
 ## ⚙️️Installation
 --------------------------------------------
 Here are 2 different methods you can use to run the quantized versions of Llama-3.3-70B-Instruct
 Ollama runs as a local service.
 Although it technically works using a command-line interface, Ollama's best attribute is their REST API.
 Being able to utilize your locally ran LLMs through the use of this API can give you almost endless possibilities!
 *Feel free to reach out to me if you would like to know some examples that I use this API for*
 #### ☑️  How to install Ollama
 ```
 Using Windows, or Mac you will then download a file and run it.
 If you are using linux it will just provide a single command that you need to run in your terminal window.
+*That's about it for installing Ollama*
 #### ✅Using Llama-3.3-70B-Instruct-GGUF with  Ollama
 Ollama does have a Model Library where you can download models:
 ```shell
 https://ollama.com/library
 ```
 This Model Library offers many different LLM versions that you can use.
+However at the time of writing this, there is no version of Llama-3.3-Instruct offered in the Ollama library.
+If you would like to use Llama-3.3-Instruct (70B), do the following:
 | #  | Running the 70B quantized version of Llama 3.3-Instruct with Ollama                          |
 |----|----------------------------------------------------------------------------------------------|
 *Replace Q4_K_M with whatever version you would like to use from this repository.*
 | #  | Running the 70B quantized version of Llama 3.3-Instruct with Ollama - *continued* |
 |----|-----------------------------------------------------------------------------------|
+| 3. | This will download & run the model. It will also be saved for future use.         |
 -------------------------------------------------