Triangle104
/

magnum-v4-22b-Q5_K_S-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Triangle104 commited on 21 days ago

Commit

7234701

•

1 Parent(s): 0958ee1

Update README.md

Files changed (1) hide show

README.md +34 -0

README.md CHANGED Viewed

@@ -118,6 +118,40 @@ model-index:
 This model was converted to GGUF format from [`anthracite-org/magnum-v4-22b`](https://huggingface.co/anthracite-org/magnum-v4-22b) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/anthracite-org/magnum-v4-22b) for more details on the model.
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)

 This model was converted to GGUF format from [`anthracite-org/magnum-v4-22b`](https://huggingface.co/anthracite-org/magnum-v4-22b) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/anthracite-org/magnum-v4-22b) for more details on the model.
+---
+Model details:
+-
+This is a series of models designed to replicate the prose quality of the Claude 3 models, specifically Sonnet and Opus.
+This model is fine-tuned on top of Mistral-Small-Instruct-2409.
+Prompting
+-
+A typical input would look like this:
+<s>[INST] SYSTEM MESSAGE
+USER MESSAGE[/INST] ASSISTANT MESSAGE</s>[INST] USER MESSAGE[/INST]
+Credits
+-
+We'd like to thank Recursal / Featherless for sponsoring the compute for this train, Featherless has been hosting our Magnum models since the first 72 B and has given thousands of people access to our models and helped us grow.
+We would also like to thank all members of Anthracite who made this finetune possible.
+Datasets
+-
+    anthracite-org/c2_logs_32k_mistral-v3_v1.2_no_system
+    anthracite-org/kalo-opus-instruct-22k-no-refusal-no-system
+    anthracite-org/kalo-opus-instruct-3k-filtered-no-system
+    anthracite-org/nopm_claude_writing_fixed
+    anthracite-org/kalo_opus_misc_240827_no_system
+    anthracite-org/kalo_misc_part2_no_system
+Training
+-
+The training was done for 2 epochs. We used 8xH100s GPUs graciously provided by Recursal AI / Featherless AI for the full-parameter fine-tuning of the model.
+---
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)