PDF-Extract-Kit / README.md
wanderkid's picture
Update README.md
d2a5e02 verified
### Install Git LFS
Before you begin, make sure Git Large File Storage (Git LFS) is installed on your system. Install it using the following command:
```bash
git lfs install
```
### Download the Model from Hugging Face
To download the `PDF-Extract-Kit` model from Hugging Face, use the following command:
```bash
git lfs clone https://huggingface.co/opendatalab/PDF-Extract-Kit
```
Ensure that Git LFS is enabled during the clone to properly download all large files.
### Download the Model from ModelScope
#### SDK Download
```bash
# First, install the ModelScope library using pip:
pip install modelscope
```
```python
# Use the following Python code to download the model using the ModelScope SDK:
from modelscope import snapshot_download
model_dir = snapshot_download('opendatalab/PDF-Extract-Kit')
```
#### Git Download
Alternatively, you can use Git to clone the model repository from ModelScope:
```bash
git clone https://www.modelscope.cn/opendatalab/PDF-Extract-Kit.git
```
Put [model files]() here:
```
./
β”œβ”€β”€ Layout
β”‚ β”œβ”€β”€ config.json
β”‚ └── model_final.pth
β”œβ”€β”€ MFD
β”‚ └── weights.pt
β”œβ”€β”€ MFR
β”‚ └── UniMERNet
β”‚ β”œβ”€β”€ config.json
β”‚ β”œβ”€β”€ preprocessor_config.json
β”‚ β”œβ”€β”€ pytorch_model.bin
β”‚ β”œβ”€β”€ README.md
β”‚ β”œβ”€β”€ tokenizer_config.json
β”‚ └── tokenizer.json
β”œβ”€β”€ TabRec
β”‚ └── StructEqTable
β”‚ β”œβ”€β”€ config.json
β”‚ β”œβ”€β”€generation_config.json
β”‚ β”œβ”€β”€model.safetensors
β”‚ β”œβ”€β”€preprocessor_config.json
β”‚ β”œβ”€β”€special_tokens_map.json
β”‚ β”œβ”€β”€spiece.model
β”‚ β”œβ”€β”€tokenizer_config.json
β”‚ └──tokenizer.json
└── README.md
```