MohamedRashad
/

arabic-large-nougat

@@ -19,9 +19,17 @@ datasets:
 ## Description
 The arabic-large-nougat OCR is an end-to-end structured Optical Character Recognition (OCR) system designed specifically for the Arabic language.
-The model is based on the [facebook/nougat-small](https://huggingface.co/facebook/nougat-small) architecture and has been fine-tuned using the [Khatt dataset](https://huggingface.co/datasets/Fakhraddin/khatt) along with a custom dataset created for this purpose.
 ## How to Get Started with the Model
@@ -29,6 +37,9 @@ The model is based on the [facebook/nougat-small](https://huggingface.co/faceboo
 Or, use the code below to get started with the model locally.
 ```python
 from PIL import Image
 import torch
@@ -105,8 +116,7 @@ By selecting the GPL 3.0 license, you promote the principles of open source and
 ### Citation
-If you find this model useful, please consider citing the original facebook/nougat-base model and the datasets used for fine-tuning, including the Khatt dataset and any details regarding the custom dataset.
 ```bibtex
 @misc{rashad2024arabicnougatfinetuningvisiontransformers,
       title={Arabic-Nougat: Fine-Tuning Vision Transformers for Arabic OCR and Markdown Extraction},
@@ -121,4 +131,4 @@ If you find this model useful, please consider citing the original facebook/noug
 ### Disclaimer
-The arabic-base-nougat OCR is a tool provided "as is," and the developers make no guarantees regarding its suitability for specific tasks. Users are encouraged to thoroughly evaluate the model's output for their particular use cases and requirements.

 ## Description
+<div align="center">
+<!-- **Affiliations:** -->
+[**Github**](https://github.com/MohamedAliRashad/arabic-nougat)  🤗  [**Hugging Face**](https://huggingface.co/collections/MohamedRashad/arabic-nougat-673a3f540bd92904c9b92a8e) 📝  [**Paper**](https://arxiv.org/abs/2411.17835) 🗂️  [**Data**](https://huggingface.co/datasets/MohamedRashad/arabic-img2md) 📽️  [**Demo**](https://huggingface.co/spaces/MohamedRashad/Arabic-Nougat)
+</div>
 The arabic-large-nougat OCR is an end-to-end structured Optical Character Recognition (OCR) system designed specifically for the Arabic language.
+This model was trained from scratch based on the new tokenizer [riotu-lab/Aranizer-PBE-86k](https://huggingface.co/riotu-lab/Aranizer-PBE-86k) with the base nougat architecture.
+The training happened using the [MohamedRashad/arabic-img2md](https://huggingface.co/datasets/MohamedRashad/arabic-img2md) dataset.
 ## How to Get Started with the Model
 Or, use the code below to get started with the model locally.
+Don't forget to update transformers:
+`pip install -U transformers`
 ```python
 from PIL import Image
 import torch
 ### Citation
+If you find this model useful, please cite the corresponding research paper:
 ```bibtex
 @misc{rashad2024arabicnougatfinetuningvisiontransformers,
       title={Arabic-Nougat: Fine-Tuning Vision Transformers for Arabic OCR and Markdown Extraction},
 ### Disclaimer
+The arabic-large-nougat OCR is a tool provided "as is," and the developers make no guarantees regarding its suitability for specific tasks. Users are encouraged to thoroughly evaluate the model's output for their particular use cases and requirements.