ControlLLM
/

Llama-3.1-8B-SynE-Concat16-Lerp

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Add metadata and paper link

#1

by nielsr HF staff - opened 16 days ago

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

Files changed (1) hide show

README.md +7 -2

README.md CHANGED Viewed

@@ -9,6 +9,8 @@ datasets:
 - codingsteven/Llama-3-8B-chat
 language:
 - zh
 base_model:
 - meta-llama/Llama-3.1-8B
 model-index:
@@ -71,12 +73,15 @@ model-index:
       value: 0.37368330167501296
       stderr: 0.00438421288652232
       verified: false
 ---
 # Control-LLM-Llama3.1-8B-SynE-Concat16-Lerp
 This is a fine-tuned model of Llama-3.1-8B for muliligual-Chinese tasks on SynE dataset by Control LLM-Concat16-Lerp.
 ## Linked Paper
-This model is associated with the paper: [Control-LLM](https://arxiv.org/abs/2501.10979).
 ## Evaluation Results
 Here is an overview of the evaluation results and findings:
@@ -107,4 +112,4 @@ The table below summarizes evaluation results across Chinese tasks and original
 - **MLU**: MMLU (Massive Multitask Language Understanding)
 - **MLUP**: MMLU Pro
 - **O-Avg**: Original Capability - Size Weighted Average across BBH, MLU, and MLUP
-- **Overall**: Combined average across all tasks

 - codingsteven/Llama-3-8B-chat
 language:
 - zh
+metrics:
+- accuracy
 base_model:
 - meta-llama/Llama-3.1-8B
 model-index:
       value: 0.37368330167501296
       stderr: 0.00438421288652232
       verified: false
+pipeline_tag: text-generation
+library_name: transformers
 ---
 # Control-LLM-Llama3.1-8B-SynE-Concat16-Lerp
 This is a fine-tuned model of Llama-3.1-8B for muliligual-Chinese tasks on SynE dataset by Control LLM-Concat16-Lerp.
 ## Linked Paper
+This model is associated with the paper: [Control LLM: Controlled Evolution for Intelligence Retention in LLM](https://huggingface.co/papers/2501.10979).
 ## Evaluation Results
 Here is an overview of the evaluation results and findings:
 - **MLU**: MMLU (Massive Multitask Language Understanding)
 - **MLUP**: MMLU Pro
 - **O-Avg**: Original Capability - Size Weighted Average across BBH, MLU, and MLUP
+- **Overall**: Combined average across all tasks