HURIDOCS
/

pdf-document-layout-analysis

Inference Endpoints

Model card Files Files and versions Community

pdf-document-layout-analysis / README.md

ali6parmak's picture

Update README.md

7aa0e84 verified 9 months ago

|

1.81 kB

	---
	license: openrail
	---


	<h3 align="center">PDF Document Layout Analysis</h3>
	<p align="center">Models for extracting segments alongside with their types from a PDF</p>

	In this model card, we are providing the non-visual models we use in our pdf-document-layout-analysis service:

	https://github.com/huridocs/pdf-document-layout-analysis

	This service allows for the segmentation and classification of different parts of PDF pages, identifying the elements such as texts, titles, pictures, tables and so on. Additionally, it determines the correct order of these identified elements.


	## Quick Start

	Clone the service:

	git clone https://github.com/huridocs/pdf-document-layout-analysis.git
	cd pdf-document-layout-analysis

	Start the service:

	# With GPU support:
	make start

	# Without GPU support [if you do not have a GPU on your system]
	make start_no_gpu


	Get the segments of a PDF:

	# With visual models
	curl -X POST -F 'file=@/PATH/TO/PDF/pdf_name.pdf' localhost:5060

	# With non-visual models [with the models in this model card]
	curl -X POST -F 'file=@/PATH/TO/PDF/pdf_name.pdf' localhost:5060/fast


	When the process is done, the output will include a list of SegmentBox elements and, every SegmentBox element will has this information:

	{
	"left": Left position of the segment
	"top": Top position of the segment
	"width": Width of the segment
	"height": Height of the segment
	"page_number": Page number which the segment belongs to
	"text": Text inside the segment
	"type": Type of the segment
	}


	To stop the server:

	make stop


	For more information, you can refer to:

	https://github.com/huridocs/pdf-document-layout-analysis