Triangle104
/

Pygmalion-3-12B-Q5_K_S-GGUF

Inference Endpoints

Model card Files Files and versions Community

Triangle104 commited on 9 days ago

Commit

fe01368

·

verified ·

1 Parent(s): 0351d0b

Update README.md

Files changed (1) hide show

README.md +75 -0

README.md CHANGED Viewed

@@ -12,6 +12,81 @@ tags:
 This model was converted to GGUF format from [`PygmalionAI/Pygmalion-3-12B`](https://huggingface.co/PygmalionAI/Pygmalion-3-12B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/PygmalionAI/Pygmalion-3-12B) for more details on the model.
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)

 This model was converted to GGUF format from [`PygmalionAI/Pygmalion-3-12B`](https://huggingface.co/PygmalionAI/Pygmalion-3-12B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/PygmalionAI/Pygmalion-3-12B) for more details on the model.
+---
+Dataset
+-
+We've gathered a large collection of instructions and roleplaying totaling hundreds of millions of tokens, including our PIPPA dataset and roleplaying forums.
+Limitations and biases
+-
+The intended use-case for this model is fictional writing for entertainment purposes. Any other sort of usage is out of scope.
+As such, it was not fine-tuned to be safe and
+harmless: the base model and this fine-tune have been trained on data
+known to contain profanity and texts that are lewd or otherwise
+offensive. It may produce socially unacceptable or undesirable text,
+even if the prompt itself does not include anything explicitly
+offensive. Outputs might often be factually wrong or misleading.
+Training Specifications
+-
+We trained our model as a rank-32 LoRA adapter with one epoch over
+our data using 8x NVIDIA A40 GPUs. For this run, we employed a learning
+rate of 2e-4 and a total batch size across all GPUs of 24. A cosine
+learning rate scheduler was used with a 100 step warmup. DeepSpeed ZeRO
+was used to successfully get memory usage down.
+Acknowledgements
+-
+This project could not have been done without the compute support of Hive Digital Technologies and the Axolotl training software.
+We'd like to extensively thank lemonilia for their wonderful help in compiling roleplay forum data.
+And most of all, we dedicate this model to our great community,
+who've stuck with us through everything until now. Sincerely, thank you
+so much. We hope you enjoy our work to the fullest and we promise more
+is on the way soon.
+---
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)