--- base_model: tiiuae/Falcon3-10B-Base library_name: transformers license: other license_name: falcon-llm-license license_link: https://falconllm.tii.ae/falcon-terms-and-conditions.html tags: - falcon3 model-index: - name: Falcon3-10B-Instruct results: - task: type: text-generation name: Text Generation dataset: name: IFEval (0-Shot) type: HuggingFaceH4/ifeval args: num_few_shot: 0 metrics: - type: inst_level_strict_acc and prompt_level_strict_acc value: 78.17 name: strict accuracy source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=tiiuae/Falcon3-10B-Instruct name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: BBH (3-Shot) type: BBH args: num_few_shot: 3 metrics: - type: acc_norm value: 44.82 name: normalized accuracy source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=tiiuae/Falcon3-10B-Instruct name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: MATH Lvl 5 (4-Shot) type: hendrycks/competition_math args: num_few_shot: 4 metrics: - type: exact_match value: 25.91 name: exact match source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=tiiuae/Falcon3-10B-Instruct name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: GPQA (0-shot) type: Idavidrein/gpqa args: num_few_shot: 0 metrics: - type: acc_norm value: 10.51 name: acc_norm source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=tiiuae/Falcon3-10B-Instruct name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: MuSR (0-shot) type: TAUR-Lab/MuSR args: num_few_shot: 0 metrics: - type: acc_norm value: 13.61 name: acc_norm source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=tiiuae/Falcon3-10B-Instruct name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: MMLU-PRO (5-shot) type: TIGER-Lab/MMLU-Pro config: main split: test args: num_few_shot: 5 metrics: - type: acc value: 38.1 name: accuracy source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=tiiuae/Falcon3-10B-Instruct name: Open LLM Leaderboard --- [![QuantFactory Banner](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)](https://hf.co/QuantFactory) # QuantFactory/Falcon3-10B-Instruct-GGUF This is quantized version of [tiiuae/Falcon3-10B-Instruct](https://huggingface.co/tiiuae/Falcon3-10B-Instruct) created using llama.cpp # Original Model Card
Category | Benchmark | Yi-1.5-9B-Chat | Mistral-Nemo-Base-2407 (12B) | Falcon3-10B-Instruct |
---|---|---|---|---|
General | MMLU (5-shot) | 70 | 65.9 | 71.6 |
MMLU-PRO (5-shot) | 39.6 | 32.7 | 44 | |
IFEval | 57.6 | 63.4 | 78 | |
Math | GSM8K (5-shot) | 76.6 | 73.8 | 83.1 |
GSM8K (8-shot, COT) | 78.5 | 73.6 | 81.3 | |
MATH Lvl-5 (4-shot) | 8.8 | 0.4 | 22.1 | |
Reasoning | Arc Challenge (25-shot) | 51.9 | 61.6 | 64.5 |
GPQA (0-shot) | 35.4 | 33.2 | 33.5 | |
GPQA (0-shot, COT) | 16 | 12.7 | 32.6 | |
MUSR (0-shot) | 41.9 | 38.1 | 41.1 | |
BBH (3-shot) | 49.2 | 43.6 | 58.4 | |
CommonSense Understanding | PIQA (0-shot) | 76.4 | 78.2 | 78.4 |
SciQ (0-shot) | 61.7 | 76.4 | 90.4 | |
Winogrande (0-shot) | - | - | 71.3 | |
OpenbookQA (0-shot) | 43.2 | 47.4 | 48.2 | |
Instructions following | MT-Bench (avg) | 8.28 | 8.6 | 8.17 |
Alpaca (WC) | 25.81 | 45.44 | 24.7 | |
Tool use | BFCL AST (avg) | 48.4 | 74.2 | 86.3 |
Code | EvalPlus (0-shot) (avg) | 69.4 | 58.9 | 74.7 |
Multipl-E (0-shot) (avg) | - | 34.5 | 45.8 |