|
--- |
|
license: llama3.1 |
|
tags: |
|
- not-for-all-audiences |
|
--- |
|
<p align="center"><font size="4"><b>Auto-loaded prompt format will not work with these GGUFs, will requant but for now you can just switch in ooba, etc</b></font></p> |
|
<p align="center"><img src="https://cdn-uploads.huggingface.co/production/uploads/633a809fa4a8f33508dce32c/BMdwgJ6cHZWbiGL48Q-Wq.png"/><font size="6"><b>Techne-RP-8b</b></font></p> |
|
<p align="center"><font size="4"><b>Trained with Llama 3 prompt formatting, Alpaca works too</b></font></p> |
|
<p align="center"><img src="https://cdn-uploads.huggingface.co/production/uploads/633a809fa4a8f33508dce32c/wNVjJ1u0kLR0cL-9Bwk4P.png"/></p> |
|
|
|
----- |
|
|
|
<p align="center"><font size="5"> <b>Assistant Example @ q5_k_m</b> </font></p> |
|
<p align="center"><img src="https://cdn-uploads.huggingface.co/production/uploads/633a809fa4a8f33508dce32c/dN45v5YHdIVyOacRx4xSc.png"/></p> |
|
<p align="center"><img src="https://cdn-uploads.huggingface.co/production/uploads/633a809fa4a8f33508dce32c/8qE0ikdtibgFtMZ-SVH1P.png"/></p> |
|
|
|
----- |
|
|
|
<p align="center"><font size="5"> <b>NSFW Writing Example @ q5_k_m</b> </font></p> |
|
<p align="center"><img src="https://cdn-uploads.huggingface.co/production/uploads/633a809fa4a8f33508dce32c/exle9vh1IFoKnAcIL1D64.png"/></p> |
|
|
|
----- |
|
|
|
<p align="center"><font size="5"> <b>Training Methodology</b> </font></p> |
|
|
|
athirdpath/Llama-3.1-Instruct_NSFW-pretrained_e1-plus_reddit was further trained in the order below: |
|
|
|
## SFT |
|
- Doctor-Shotgun/no-robots-sharegpt |
|
- grimulkan/LimaRP-augmented |
|
- Inv/c2-logs-cleaned-deslopped |
|
|
|
## DPO |
|
- jondurbin/truthy-dpo-v0.1 |
|
- Undi95/Weyaxi-humanish-dpo-project-noemoji |
|
- athirdpath/DPO_Pairs-Roleplay-Llama3-NSFW |