Arabic LLM Models

Community Article Published March 4, 2025

image/png

The ecosystem of Arabic LLM models is quickly expanding, presenting a challenge in keeping up with the latest developments. This article aims to address this issue by serving as a comprehensive resource that continuously updates with new Arabic LLM models, providing users with the necessary information and links to choose the best model for their specific task. This living document will serve as a go-to source for all your Arabic LLM model needs.

Selection Criteria

For a model to be included, one of the following needs to be true

  • Model is open source
  • Model can be tried online via a link
  • Model is offered as an API

General Purpose Models

Below is a list of general purpose Arabic Models (Order does not indicate performance)

Name Size License Link Comments
SILMA v1.0 9B Open-weight (Gemma) https://huggingface.co/silma-ai/SILMA-9B-Instruct-v1.0 Based on Gemma. Was #1 on OALL V1 benchmark
Fanar 7B Closed https://chat.fanar.qa/ Qatar's Sovereign Model
Allam 7B Open Weight (Apache 2.0) https://huggingface.co/ALLaM-AI/ALLaM-7B-Instruct-preview Saudi's Sovereign Model
Jais 590M to 70B Open Weight (Apache 2.0) https://huggingface.co/collections/inceptionai/jais-family-66add8bb9c381f5492ddb6f4 UAE's Arabic Models and one of the first players
AceGPT-7B-chat 7B-32B Open Weight (Apache 2.0) https://huggingface.co/FreedomIntelligence/AceGPT-7B-chat
Cohere command-r7b-arabic 8B Open Weight (CC Non Commercial 4.0) https://huggingface.co/CohereForAI/c4ai-command-r7b-arabic-02-2025 General purpose + optimized for RAG
Cohere aya-expanse 8B-32B Open Weight (CC Non Commercial 4.0) https://huggingface.co/CohereForAI/aya-expanse-32b
Gemma 2B-27B Open Weight (Gemma) https://huggingface.co/google/gemma-2-9b-it Google's multilingual open model which includes Arabic
Qwen 2.5 0.5B-72B Open Weight (Apache 2.0) https://huggingface.co/Qwen/Qwen2.5-0.5B Alibaba's multilingual open model which includes Arabic
Llama 3.3 70B Open Weight (Llama 3.3 Community License Agreement) https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct Meta's multilingual open model which includes Arabic. Very good performance in OALL benchmark
Llama 3.2 1B-3B Open Weight (Llama 3.3 Community License Agreement) https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct Meta's multilingual open model which includes Arabic.
Phi 3.5 4B Open Weight (MIT) https://huggingface.co/microsoft/Phi-3.5-mini-instruct Microsoft's multilingual open model which includes Arabic.
Phi 4 4B Open Weight (MIT) https://huggingface.co/microsoft/Phi-4-mini-instruct Microsoft's multilingual open model which includes Arabic.
Mistral Saba 24B Closed https://mistral.ai/news/mistral-saba Offered via API only
Ar-stablelm-2-chat 1.6B Open Weight (MIT) https://huggingface.co/stabilityai/ar-stablelm-2-chat
Yehia-7B-preview 7B Open Weight (MIT) https://huggingface.co/Navid-AI/Yehia-7B-preview Based on Allam

RAG-optimized Models

Below is a list of models trained and optimized for RAG Generatation use-cases

Name Size License Link Comments
SILMA Kashif v1.0 2B Open-weight (Gemma) https://huggingface.co/silma-ai/SILMA-Kashif-2B-Instruct-v1.0 Benchmark
Cohere command-r7b-arabic 8B Open Weight (CC Non Commercial 4.0) https://huggingface.co/CohereForAI/c4ai-command-r7b-arabic-02-2025 General purpose + optimized for RAG

Vision & OCR

Below is a list of models with multimodal capabilities (Vision, text, ...etc)

Name Size License Link Comments
AIN 8B Open Weight (MIT) https://huggingface.co/MBZUAI/AIN Based on Qwen
Qari OCR 2B Open Weight (Apache 2.0) https://huggingface.co/NAMAA-Space/Qari-OCR-0.1-VL-2B-Instruct Based on Qwen. OCR only
Cohere aya-vision 8B-32B Open Weight (CC Non Commercial 4.0) https://huggingface.co/collections/CohereForAI/c4ai-aya-vision-67c4ccd395ca064308ee1484

Dialect-optimized Models - Syrian Arabic

Models optimized for the Leventian dialect

Name Size License Link Comments
Shahin-v0.1 14B Open Weight (Apache 2.0) https://huggingface.co/malhajar/Shahin-v0.1 Based on Qwen

Dialect-optimized Models - Morocan Arabic

Models tuned for Darija, the colloquial Arabic of Morocco

Name Size License Link Comments
Atlas-Chat 9B-27B Open Weight (Gemma) https://huggingface.co/MBZUAI-Paris/Atlas-Chat-9B based on Gemma

Dialect-optimized Models - Tunisian Arabic

Models tuned for Tunisian Arabic

Name Size License Link Comments
Labess Chat 7B Open Weight (apache-2.0) https://huggingface.co/linagora/Labess-7b-chat based on Jais

A model is missing?

If you believe that a model is not included in the list, please leave a comment below. If it meets the necessary criteria, it will be added.

How to choose a model?

In addition to testing the model on real-world use cases, benchmarks are valuable for evaluating various aspects of the model's performance.

The following post contains a list of Arabic AI Benchmarks https://huggingface.co/blog/silma-ai/arabic-ai-benchmarks-and-leaderboards

Community

Hello Karim, this is very insightful and useful thank you for that!
But I know some Arabic Vision Language Model that can be added if this is helpful.

  1. Maya: MSA
  2. Palo : MSA
  3. Dallah : Dialect
  4. Peacock: MSA
  5. Pangea: MSA
·
Article author

Please share the links, also make sure it passes any of the the following criteria

Model is open source
Model can be tried online via a link
Model is offered as an API

Sign up or log in to comment