TheBloke
/

Marcoroni-7b-GGUF

Text Generation

Transformers

GGUF

English

llama

Model card Files Files and versions Community

TheBloke commited on Sep 12, 2023

Commit

a0c033a

•

1 Parent(s): 0b0e1aa

Upload README.md

Browse files

Files changed (1) hide show

README.md +45 -3

README.md CHANGED Viewed

@@ -1,10 +1,15 @@
 ---
 inference: false
 license: cc-by-nc-4.0
 model_creator: AIDC-ai-business
-model_link: https://huggingface.co/AIDC-ai-business/Marcoroni-7b
 model_name: Marcoroni 7b
 model_type: llama
 quantized_by: TheBloke
 ---
@@ -62,11 +67,16 @@ Here is an incomplate list of clients and libraries that are known to support GG
 <!-- repositories-available end -->
 <!-- prompt-template start -->
-## Prompt template: Unknown
 ```
 {prompt}
 ```
 <!-- prompt-template end -->
@@ -131,7 +141,7 @@ Refer to the Provided Files table below to see what files use which methods, and
 Make sure you are using `llama.cpp` from commit [d0cee0d36d5be95a0d9088b674dbb27354107221](https://github.com/ggerganov/llama.cpp/commit/d0cee0d36d5be95a0d9088b674dbb27354107221) or later.
 ```shell
-./main -ngl 32 -m marcoroni-7b.q4_K_M.gguf --color -c 4096 --temp 0.7 --repeat_penalty 1.1 -n -1 -p "{prompt}"
 ```
 Change `-ngl 32` to the number of layers to offload to GPU. Remove it if you don't have GPU acceleration.
@@ -222,5 +232,37 @@ And thank you again to a16z for their generous grant.
 <!-- original-model-card start -->
 # Original model card: AIDC-ai-business's Marcoroni 7b
 <!-- original-model-card end -->

 ---
+base_model: https://huggingface.co/AIDC-ai-business/Marcoroni-7b
+datasets:
+- Open-Orca/OpenOrca
 inference: false
+language:
+- en
 license: cc-by-nc-4.0
 model_creator: AIDC-ai-business
 model_name: Marcoroni 7b
 model_type: llama
+pipeline_tag: text-generation
 quantized_by: TheBloke
 ---
 <!-- repositories-available end -->
 <!-- prompt-template start -->
+## Prompt template: Alpaca
 ```
+Below is an instruction that describes a task. Write a response that appropriately completes the request.
+### Instruction:
 {prompt}
+### Response:
 ```
 <!-- prompt-template end -->
 Make sure you are using `llama.cpp` from commit [d0cee0d36d5be95a0d9088b674dbb27354107221](https://github.com/ggerganov/llama.cpp/commit/d0cee0d36d5be95a0d9088b674dbb27354107221) or later.
 ```shell
+./main -ngl 32 -m marcoroni-7b.q4_K_M.gguf --color -c 4096 --temp 0.7 --repeat_penalty 1.1 -n -1 -p "Below is an instruction that describes a task. Write a response that appropriately completes the request.\n\n### Instruction:\n{prompt}\n\n### Response:"
 ```
 Change `-ngl 32` to the number of layers to offload to GPU. Remove it if you don't have GPU acceleration.
 <!-- original-model-card start -->
 # Original model card: AIDC-ai-business's Marcoroni 7b
+# Marcoroni-7B
+Fine-tuned from Llama2-7B，we use Orca-style data and other open source data for fine-tuning.
+# Model Details
+* **Trained by**: trained by AIDC AI-Business.
+* **Model type:**  **Marcoroni-7B** is an auto-regressive language model based on the Llama 2 transformer architecture.
+* **Language(s)**: English
+* **License for Marcoroni-7B base weights**: Non-Commercial Creative Commons license ([CC BY-NC-4.0](https://creativecommons.org/licenses/by-nc/4.0/))
+# Prompting
+## Prompt Template for alpaca style
+```
+### Instruction:
+<prompt> (without the <>)
+### Response:
+```
+# Evulation Results ([Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard))
+| Metric                | Value |
+|-----------------------|-------|
+| Avg.                  |   60.1   |
+| ARC (25-shot)         |   58.11   |
+| HellaSwag (10-shot)   |   80.08   |
+| MMLU (5-shot)         |   51.36   |
+| TruthfulQA (0-shot)   |   50.85   |
 <!-- original-model-card end -->