Create README.md (#1)
Browse files- Create README.md (16afb43798577350161f69287071f59aada36fe3)
    	
        README.md
    ADDED
    
    | 
         @@ -0,0 +1,47 @@ 
     | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
| 
         | 
|
| 1 | 
         
            +
            ---
         
     | 
| 2 | 
         
            +
            license: apache-2.0
         
     | 
| 3 | 
         
            +
            tags:
         
     | 
| 4 | 
         
            +
            - vision
         
     | 
| 5 | 
         
            +
            pipeline_tag: image-to-text
         
     | 
| 6 | 
         
            +
            ---
         
     | 
| 7 | 
         
            +
             
     | 
| 8 | 
         
            +
            # Nougat model, small-sized version
         
     | 
| 9 | 
         
            +
             
     | 
| 10 | 
         
            +
            Nougat model trained on PDF-to-markdown. It was introduced in the paper [Nougat: Neural Optical Understanding for Academic Documents](https://arxiv.org/abs/2308.13418) by Blecher et al. and first released in [this repository](https://github.com/facebookresearch/nougat/tree/main). 
         
     | 
| 11 | 
         
            +
             
     | 
| 12 | 
         
            +
            Disclaimer: The team releasing Nougat did not write a model card for this model so this model card has been written by the Hugging Face team.
         
     | 
| 13 | 
         
            +
             
     | 
| 14 | 
         
            +
            Note: this model corresponds to the "0.1.0-small" version of the original repository/
         
     | 
| 15 | 
         
            +
             
     | 
| 16 | 
         
            +
            ## Model description
         
     | 
| 17 | 
         
            +
             
     | 
| 18 | 
         
            +
            Nougat is a [Donut](https://huggingface.co/docs/transformers/model_doc/donut) model trained to transcribe scientific PDFs into an easy-to-use markdown format. The model consists of a Swin Transformer as vision encoder, and an mBART model as text decoder.
         
     | 
| 19 | 
         
            +
             
     | 
| 20 | 
         
            +
            The model is trained to autoregressively predict the markdown given only the pixels of the PDF image as input.
         
     | 
| 21 | 
         
            +
             
     | 
| 22 | 
         
            +
            <img src="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/transformers/model_doc/nougat_architecture.jpg"
         
     | 
| 23 | 
         
            +
            alt="drawing" width="600"/>
         
     | 
| 24 | 
         
            +
             
     | 
| 25 | 
         
            +
            <small> Nougat high-level overview. Taken from the <a href="https://arxiv.org/abs/2308.13418">original paper</a>. </small>
         
     | 
| 26 | 
         
            +
             
     | 
| 27 | 
         
            +
            ## Intended uses & limitations
         
     | 
| 28 | 
         
            +
             
     | 
| 29 | 
         
            +
            You can use the raw model for transcribing a PDF into Markdown. See the [model hub](https://huggingface.co/models?search=nougat) to look for other
         
     | 
| 30 | 
         
            +
            fine-tuned versions that may interest you.
         
     | 
| 31 | 
         
            +
             
     | 
| 32 | 
         
            +
            ### How to use
         
     | 
| 33 | 
         
            +
             
     | 
| 34 | 
         
            +
            We refer to the [docs](https://huggingface.co/docs/transformers/main/en/model_doc/nougat).
         
     | 
| 35 | 
         
            +
             
     | 
| 36 | 
         
            +
            ### BibTeX entry and citation info
         
     | 
| 37 | 
         
            +
             
     | 
| 38 | 
         
            +
            ```bibtex
         
     | 
| 39 | 
         
            +
            @misc{blecher2023nougat,
         
     | 
| 40 | 
         
            +
                  title={Nougat: Neural Optical Understanding for Academic Documents}, 
         
     | 
| 41 | 
         
            +
                  author={Lukas Blecher and Guillem Cucurull and Thomas Scialom and Robert Stojnic},
         
     | 
| 42 | 
         
            +
                  year={2023},
         
     | 
| 43 | 
         
            +
                  eprint={2308.13418},
         
     | 
| 44 | 
         
            +
                  archivePrefix={arXiv},
         
     | 
| 45 | 
         
            +
                  primaryClass={cs.LG}
         
     | 
| 46 | 
         
            +
            }
         
     | 
| 47 | 
         
            +
            ```
         
     |