Triangle104
/

Dumpling-Mistral-Nemo-8B-Q4_K_S-GGUF

Inference Endpoints

Model card Files Files and versions Community

Triangle104 commited on 17 days ago

Commit

a77b045

·

verified ·

1 Parent(s): 648df0a

Update README.md

Files changed (1) hide show

README.md +38 -0

README.md CHANGED Viewed

@@ -10,6 +10,44 @@ base_model: nbeerbower/Dumpling-Mistral-Nemo-8B
 This model was converted to GGUF format from [`nbeerbower/Dumpling-Mistral-Nemo-8B`](https://huggingface.co/nbeerbower/Dumpling-Mistral-Nemo-8B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/nbeerbower/Dumpling-Mistral-Nemo-8B) for more details on the model.
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)

 This model was converted to GGUF format from [`nbeerbower/Dumpling-Mistral-Nemo-8B`](https://huggingface.co/nbeerbower/Dumpling-Mistral-Nemo-8B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/nbeerbower/Dumpling-Mistral-Nemo-8B) for more details on the model.
+---
+🧪 Experimental
+An attempt to recover intelligence with a quick train, results are meh
+Dumpling-Mistral-Nemo-8B
+nbeerbower/mistral-nemo-kartoffel-PRUNE3 finetuned on:
+-nbeerbower/GreatFirewall-DPO
+-nbeerbower/Schule-DPO
+-nbeerbower/Purpura-DPO
+-nbeerbower/Arkhaios-DPO
+-jondurbin/truthy-dpo-v0.1
+-antiven0m/physical-reasoning-dpo
+-flammenai/Date-DPO-NoAsterisks
+-flammenai/Prude-Phi3-DPO
+-Atsunori/HelpSteer2-DPO (1,000 samples)
+-jondurbin/gutenberg-dpo-v0.1
+-nbeerbower/gutenberg2-dpo
+-nbeerbower/gutenberg-moderne-dpo.
+Method
+---
+QLoRA ORPO tune with 2x RTX 3090 for 2 epochs.
+---
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)