curiousily
commited on
Commit
•
733368c
1
Parent(s):
e08eaaa
Update README.md
Browse files
README.md
CHANGED
@@ -1,5 +1,5 @@
|
|
1 |
---
|
2 |
-
license:
|
3 |
language:
|
4 |
- en
|
5 |
library_name: transformers
|
@@ -13,6 +13,14 @@ metrics:
|
|
13 |
|
14 |
This model is a fine-tuned version of [microsoft/layoutlmv3-base](https://huggingface.co/microsoft/layoutlmv3-base) trained on [Financial Documents Clustering Kaggle Dataset](https://www.kaggle.com/datasets/drcrabkg/financial-statements-clustering).
|
15 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
16 |
## Training
|
17 |
|
18 |
This model uses OCR data from [EasyOCR](https://github.com/JaidedAI/EasyOCR) instead of the default Tesseract OCR engine.
|
|
|
1 |
---
|
2 |
+
license: cc-by-nc-sa-4.0
|
3 |
language:
|
4 |
- en
|
5 |
library_name: transformers
|
|
|
13 |
|
14 |
This model is a fine-tuned version of [microsoft/layoutlmv3-base](https://huggingface.co/microsoft/layoutlmv3-base) trained on [Financial Documents Clustering Kaggle Dataset](https://www.kaggle.com/datasets/drcrabkg/financial-statements-clustering).
|
15 |
|
16 |
+
It classifies document images into one of the following (5) classes:
|
17 |
+
|
18 |
+
- Income Statements
|
19 |
+
- Balance Sheets
|
20 |
+
- Cash Flows
|
21 |
+
- Notes
|
22 |
+
- Others
|
23 |
+
|
24 |
## Training
|
25 |
|
26 |
This model uses OCR data from [EasyOCR](https://github.com/JaidedAI/EasyOCR) instead of the default Tesseract OCR engine.
|