YiSM-34B-0rn / README.md

Update README.md

8add81b verified 5 months ago

9.92 kB

	---
	license: apache-2.0
	library_name: transformers
	tags:
	- merge
	base_model:
	- 01-ai/Yi-1.5-34B-Chat
	- 01-ai/Yi-1.5-34B
	pipeline_tag: text-generation
	model-index:
	- name: YiSM-34B-0rn
	results:
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: AI2 Reasoning Challenge (25-Shot)
	type: ai2_arc
	config: ARC-Challenge
	split: test
	args:
	num_few_shot: 25
	metrics:
	- type: acc_norm
	value: 69.54
	name: normalized accuracy
	source:
	url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=altomek/YiSM-34B-0rn
	name: Open LLM Leaderboard
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: HellaSwag (10-Shot)
	type: hellaswag
	split: validation
	args:
	num_few_shot: 10
	metrics:
	- type: acc_norm
	value: 86.67
	name: normalized accuracy
	source:
	url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=altomek/YiSM-34B-0rn
	name: Open LLM Leaderboard
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: MMLU (5-Shot)
	type: cais/mmlu
	config: all
	split: test
	args:
	num_few_shot: 5
	metrics:
	- type: acc
	value: 78.51
	name: accuracy
	source:
	url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=altomek/YiSM-34B-0rn
	name: Open LLM Leaderboard
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: TruthfulQA (0-shot)
	type: truthful_qa
	config: multiple_choice
	split: validation
	args:
	num_few_shot: 0
	metrics:
	- type: mc2
	value: 59.68
	source:
	url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=altomek/YiSM-34B-0rn
	name: Open LLM Leaderboard
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: Winogrande (5-shot)
	type: winogrande
	config: winogrande_xl
	split: validation
	args:
	num_few_shot: 5
	metrics:
	- type: acc
	value: 83.66
	name: accuracy
	source:
	url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=altomek/YiSM-34B-0rn
	name: Open LLM Leaderboard
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: GSM8k (5-shot)
	type: gsm8k
	config: main
	split: test
	args:
	num_few_shot: 5
	metrics:
	- type: acc
	value: 75.82
	name: accuracy
	source:
	url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=altomek/YiSM-34B-0rn
	name: Open LLM Leaderboard
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: IFEval (0-Shot)
	type: HuggingFaceH4/ifeval
	args:
	num_few_shot: 0
	metrics:
	- type: inst_level_strict_acc and prompt_level_strict_acc
	value: 42.84
	name: strict accuracy
	source:
	url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=altomek/YiSM-34B-0rn
	name: Open LLM Leaderboard
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: BBH (3-Shot)
	type: BBH
	args:
	num_few_shot: 3
	metrics:
	- type: acc_norm
	value: 45.38
	name: normalized accuracy
	source:
	url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=altomek/YiSM-34B-0rn
	name: Open LLM Leaderboard
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: MATH Lvl 5 (4-Shot)
	type: hendrycks/competition_math
	args:
	num_few_shot: 4
	metrics:
	- type: exact_match
	value: 20.62
	name: exact match
	source:
	url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=altomek/YiSM-34B-0rn
	name: Open LLM Leaderboard
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: GPQA (0-shot)
	type: Idavidrein/gpqa
	args:
	num_few_shot: 0
	metrics:
	- type: acc_norm
	value: 16.22
	name: acc_norm
	source:
	url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=altomek/YiSM-34B-0rn
	name: Open LLM Leaderboard
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: MuSR (0-shot)
	type: TAUR-Lab/MuSR
	args:
	num_few_shot: 0
	metrics:
	- type: acc_norm
	value: 14.76
	name: acc_norm
	source:
	url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=altomek/YiSM-34B-0rn
	name: Open LLM Leaderboard
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: MMLU-PRO (5-shot)
	type: TIGER-Lab/MMLU-Pro
	config: main
	split: test
	args:
	num_few_shot: 5
	metrics:
	- type: acc
	value: 41.06
	name: accuracy
	source:
	url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=altomek/YiSM-34B-0rn
	name: Open LLM Leaderboard
	---

	#
	<img src=https://huggingface.co/altomek/YiSM-34B-0rn/resolve/main/YiSM.png>
	<a href="https://www.youtube.com/watch?v=a9dNpk9G5h0" title="P.T. Adamczyk - Never Looking Back \| Cyberpunk 2077: Phantom Liberty (Original Score)" target="_blank">intro music...</a>


	## YiSM-34B-0rn

	This is Yi Self Merged. I wanted model that will follow most instuctions yet preserve its base model nature.


	### Ingridients

	- [Yi-1.5-34B-Chat](https://huggingface.co/01-ai/Yi-1.5-34B-Chat)

	- [Yi-1.5-34B](https://huggingface.co/01-ai/Yi-1.5-34B-Chat/)


	### Settings

	I use max_seq_len 8K with alpha_value 2.65.

	SillyTavern presets:

	```json
	{
	"temp": 0.1,
	"temperature_last": true,
	"top_p": 1,
	"top_k": 0,
	"top_a": 0,
	"tfs": 1,
	"epsilon_cutoff": 0,
	"eta_cutoff": 0,
	"typical_p": 1,
	"min_p": 0,
	"rep_pen": 1.08,
	"rep_pen_range": 0,
	"no_repeat_ngram_size": 0,
	"penalty_alpha": 0,
	"num_beams": 1,
	"length_penalty": 1,
	"min_length": 0,
	"encoder_rep_pen": 1,
	"freq_pen": 0.01,
	"presence_pen": 0,
	"do_sample": true,
	"early_stopping": false,
	"add_bos_token": true,
	"truncation_length": 2048,
	"ban_eos_token": false,
	"skip_special_tokens": true,
	"streaming": true,
	"mirostat_mode": 0,
	"mirostat_tau": 5,
	"mirostat_eta": 0.1,
	"guidance_scale": 1,
	"negative_prompt": "",
	"grammar_string": "",
	"banned_tokens": "",
	"ignore_eos_token_aphrodite": false,
	"spaces_between_special_tokens_aphrodite": true,
	"sampler_order": [
	6,
	0,
	1,
	3,
	4,
	2,
	5
	],
	"logit_bias": [],
	"n": 1,
	"rep_pen_size": 0,
	"genamt": 2048,
	"max_length": 8192
	}
	```


	### Terms and Conditions of Use

	The following table outlines the primary characteristics and intended uses of my YiSM-34B-0rn models:

	\| Model Type \| Purpose \| Target Users \| Key Features \|
	\| --- \| --- \| --- \| --- \|
	\| Censored \| Suitable for general audiences and sensitive topics \| Educational institutions, families, and individuals seeking age-appropriate content \| Restricts explicit or mature material \|
	\| Neutral (<u>**this one</u>) \| Balances accessibility with openness \| Universities, researchers, and curious minds \| Encourages exploration and intellectual exchange \|
	\| Uncensored \| Ideal for adults and specialized fields \| Professionals, experts, and advanced scholars \| Offers unfiltered access to diverse viewpoints and knowledge \|

	Please remember that all YiSM-34B-0rn models operate under the apache-2.0 license, so familiarize yourself with its terms and conditions before employing their content.


	### Quants

	- [GGUF](https://huggingface.co/altomek/YiSM-34B-0rn-GGUF)
	- [8bpw](https://huggingface.co/altomek/YiSM-34B-0rn-8bpw-EXL2)
	- [6.5bpw](https://huggingface.co/altomek/YiSM-34B-0rn-6.5bpw-EXL2)
	- [4.65bpw](https://huggingface.co/altomek/YiSM-34B-0rn-4.65bpw-EXL2)
	- [4bpw](https://huggingface.co/altomek/YiSM-34B-0rn-4bpw-EXL2)
	- [3.2bpw](https://huggingface.co/altomek/YiSM-34B-0rn-3.2bpw-EXL2) -> Fits in 16GB VRAM but not recomended. Performance is significantly degraded in lower quants.
	- [measurements](https://huggingface.co/altomek/measurements/resolve/main/YiSM-34B-0rn_measurement.json) --> ExLlamav2 measurments


	### [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
	Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_altomek__YiSM-34B-0rn)

	\| Metric \|Value\|
	\|---------------------------------\|----:\|
	\|Avg. \|75.65\|
	\|AI2 Reasoning Challenge (25-Shot)\|69.54\|
	\|HellaSwag (10-Shot) \|86.67\|
	\|MMLU (5-Shot) \|78.51\|
	\|TruthfulQA (0-shot) \|59.68\|
	\|Winogrande (5-shot) \|83.66\|
	\|GSM8k (5-shot) \|75.82\|

	5th in 34B size range excluding "Private or deleted" or 8th with all models included as of 2024-06-10 ;P
	<img src=https://huggingface.co/altomek/YiSM-34B-0rn/resolve/main/5thIn34B.png>


	### [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
	Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_altomek__YiSM-34B-0rn)

	\| Metric \|Value\|
	\|-------------------\|----:\|
	\|Avg. \|30.15\|
	\|IFEval (0-Shot) \|42.84\|
	\|BBH (3-Shot) \|45.38\|
	\|MATH Lvl 5 (4-Shot)\|20.62\|
	\|GPQA (0-shot) \|16.22\|
	\|MuSR (0-shot) \|14.76\|
	\|MMLU-PRO (5-shot) \|41.06\|