ArliAI
/

Qwen2.5-32B-ArliAI-RPMax-v1.3

Safetensors

qwen2

Model card Files Files and versions Community

OwenArli commited on Nov 12

Commit

98a8bbb

•

1 Parent(s): a7387ef

Update README.md

Browse files

Files changed (1) hide show

README.md +24 -20

README.md CHANGED Viewed

@@ -1,16 +1,17 @@
 ---
 license: apache-2.0
 ---
-Qwen2.5-32B-ArliAI-RPMax-v1.3
 =====================================
-RPMax v1 Series Overview
-v1.1 = 2B | 3.8B | 8B | 9B | 12B | 20B | 22B | 70B
-v1.2 = 8B | 12B | 70B
-v1.3 = 32B
 RPMax is a series of models that are trained on a diverse set of curated creative writing and RP datasets with a focus on variety and deduplication. This model is designed to be highly creative and non-repetitive by making sure no two entries in the dataset have repeated characters or situations, which makes sure the model does not latch on to a certain personality and be capable of understanding and acting appropriately to any characters or situations.
@@ -19,7 +20,8 @@ Many RPMax users mentioned that these models does not feel like any other RP mod
 You can access the model at https://arliai.com and we also have a models ranking page at https://www.arliai.com/models-ranking
 Ask questions in our new Discord Server https://discord.com/invite/t75KbPgwhk or on our subreddit https://www.reddit.com/r/ArliAI/
-Model Description
 Qwen2.5-32B-ArliAI-RPMax-v1.3 is a variant made from the Qwen2.5-32B-Instruct model.
@@ -28,31 +30,32 @@ Let us know what you think of the model! The different parameter versions are ba
 v1.3 updated models are trained with updated software and configs such as the updated transformers library that fixes the gradient checkpointing bug which should help the model learn better.
 This version also uses RSLORA+ for training which helps the model learn even better.
-Specs
-    Context Length: 128K
-    Parameters: 32B
-Training Details
-    Sequence Length: 8192
-    Training Duration: Approximately 4 days on 2x3090Ti
-    Epochs: 1 epoch training for minimized repetition sickness
-    LORA: 64-rank 64-alpha, resulting in ~2% trainable weights
-    Learning Rate: 0.00001
-    Gradient accumulation: Very low 32 for better learning.
-Quantization
 The model is available in quantized formats:
-    FP16: https://huggingface.co/ArliAI/Qwen2.5-32B-ArliAI-RPMax-v1.3
-    GGUF: https://huggingface.co/ArliAI/Qwen2.5-32B-ArliAI-RPMax-v1.3-GGUF
-Suggested Prompt Format
 ChatML Chat Format
 <|im_start|>system
 Provide some context and/or instructions to the model.
 <|im_end|>
@@ -60,3 +63,4 @@ Provide some context and/or instructions to the model.
 The user’s message goes here
 <|im_end|>
 <|im_start|>assistant

 ---
 license: apache-2.0
 ---
+# Qwen2.5-32B-ArliAI-RPMax-v1.3
 =====================================
+## RPMax Series Overview
+v1.1 = [2B](https://huggingface.co/ArliAI/Gemma-2-2B-ArliAI-RPMax-v1.1) | [3.8B](https://huggingface.co/ArliAI/Phi-3.5-mini-3.8B-ArliAI-RPMax-v1.1) | [8B](https://huggingface.co/ArliAI/Llama-3.1-8B-ArliAI-RPMax-v1.1) | [9B](https://huggingface.co/ArliAI/Gemma-2-9B-ArliAI-RPMax-v1.1) | [12B](https://huggingface.co/ArliAI/Mistral-Nemo-12B-ArliAI-RPMax-v1.1) | [20B](https://huggingface.co/ArliAI/InternLM2_5-20B-ArliAI-RPMax-v1.1) | [22B](https://huggingface.co/ArliAI/Mistral-Small-22B-ArliAI-RPMax-v1.1) | [70B](https://huggingface.co/ArliAI/Llama-3.1-70B-ArliAI-RPMax-v1.1)
+v1.2 = [8B](https://huggingface.co/ArliAI/Llama-3.1-8B-ArliAI-RPMax-v1.2) | [12B](https://huggingface.co/ArliAI/Mistral-Nemo-12B-ArliAI-RPMax-v1.2) | [70B](https://huggingface.co/ArliAI/Llama-3.1-70B-ArliAI-RPMax-v1.2)
+v1.3 = [32B](https://huggingface.co/ArliAI/Qwen2.5-32B-ArliAI-RPMax-v1.3)
 RPMax is a series of models that are trained on a diverse set of curated creative writing and RP datasets with a focus on variety and deduplication. This model is designed to be highly creative and non-repetitive by making sure no two entries in the dataset have repeated characters or situations, which makes sure the model does not latch on to a certain personality and be capable of understanding and acting appropriately to any characters or situations.
 You can access the model at https://arliai.com and we also have a models ranking page at https://www.arliai.com/models-ranking
 Ask questions in our new Discord Server https://discord.com/invite/t75KbPgwhk or on our subreddit https://www.reddit.com/r/ArliAI/
+## Model Description
 Qwen2.5-32B-ArliAI-RPMax-v1.3 is a variant made from the Qwen2.5-32B-Instruct model.
 v1.3 updated models are trained with updated software and configs such as the updated transformers library that fixes the gradient checkpointing bug which should help the model learn better.
 This version also uses RSLORA+ for training which helps the model learn even better.
+### Specs
+* **Context Length**: 128K
+* **Parameters**: 32B
+### Training Details
+* **Sequence Length**: 8192
+* **Training Duration**: Approximately 3 days on 2x3090Ti
+* **Epochs**: 1 epoch training for minimized repetition sickness
+* **RS-QLORA+**: 64-rank 64-alpha, resulting in ~2% trainable weights
+* **Learning Rate**: 0.00001
+* **Gradient accumulation**: Very low 32 for better learning.
+## Quantization
 The model is available in quantized formats:
+* **FP16**: https://huggingface.co/ArliAI/Qwen2.5-32B-ArliAI-RPMax-v1.3
+* **GGUF**: https://huggingface.co/ArliAI/Qwen2.5-32B-ArliAI-RPMax-v1.3-GGUF
+## Suggested Prompt Format
 ChatML Chat Format
+```
 <|im_start|>system
 Provide some context and/or instructions to the model.
 <|im_end|>
 The user’s message goes here
 <|im_end|>
 <|im_start|>assistant
+```