|
--- |
|
license: cc-by-nc-4.0 |
|
datasets: |
|
- Open-Orca/OpenOrca |
|
language: |
|
- en |
|
--- |
|
<a href="https://www.buymeacoffee.com/PulsarAI" target="_blank"><img src="https://cdn.buymeacoffee.com/buttons/v2/default-yellow.png" alt="Buy Me A Coffee" style="height: 60px !important;width: 217px !important;" ></a> |
|
|
|
|
|
Merge of [Marcoroni-13B](https://huggingface.co/AIDC-ai-business/Marcoroni-13B) and [Luban-13B](https://huggingface.co/AIDC-ai-business/Luban-13B) using ties merge. |
|
|
|
### *Weights* |
|
|
|
- [Marcoroni-13B](https://huggingface.co/AIDC-ai-business/Marcoroni-13B): 0.5 |
|
|
|
- [Luban-13B](https://huggingface.co/AIDC-ai-business/Luban-13B): 0.3 |
|
|
|
### *Density* |
|
|
|
- [Marcoroni-13B](https://huggingface.co/AIDC-ai-business/Marcoroni-13B): 0.5 |
|
|
|
- [Luban-13B](https://huggingface.co/AIDC-ai-business/Luban-13B): 0.5 |
|
|
|
|
|
# Evaluation Results ([Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)) |
|
|
|
| Metric | Value | |
|
|-----------------------|-------| |
|
| Avg. | 65.21 | |
|
| ARC (25-shot) | 63.65 | |
|
| HellaSwag (10-shot) | 82.92 | |
|
| MMLU (5-shot) | 58.70 | |
|
| TruthfulQA (0-shot) | 55.55 | |
|
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard) |
|
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Weyaxi__Luban-Marcoroni-13B) |
|
|
|
| Metric | Value | |
|
|-----------------------|---------------------------| |
|
| Avg. | 51.16 | |
|
| ARC (25-shot) | 63.65 | |
|
| HellaSwag (10-shot) | 82.92 | |
|
| MMLU (5-shot) | 58.7 | |
|
| TruthfulQA (0-shot) | 55.55 | |
|
| Winogrande (5-shot) | 77.03 | |
|
| GSM8K (5-shot) | 10.01 | |
|
| DROP (3-shot) | 10.25 | |
|
|