Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
seanpedrickcase
/
document_redaction
like
5
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
3bbf593
document_redaction
/
tools
Ctrl+K
Ctrl+K
3 contributors
History:
124 commits
seanpedrickcase
Added config options for compressing output pdfs, returning output redacted pdfs at all, and for changing the length of time for showing previous Textract jobs
3bbf593
about 1 month ago
__init__.py
Safe
0 Bytes
Initial commit
about 1 year ago
auth.py
Safe
2.46 kB
Added compatibility with gradio_image_annotation for passing through id and text properties to annotator. Corrected csv location for Textract api calls. Other minor changes
about 2 months ago
aws_functions.py
Safe
9.47 kB
Improved logging format a little. Now possible to save logs to DynamoDB
about 2 months ago
aws_textract.py
Safe
27.3 kB
Now local OCR outputs can be saved to file and reloaded to save preparation time. Bug fixing in logs and tabular data redaction. Update to documentation
about 2 months ago
cli_redact.py
Safe
4.74 kB
More config options. Fixed some bugs with removing elements from review page and Adobe export. Some UI rearrangements
3 months ago
config.py
14.3 kB
Added config options for compressing output pdfs, returning output redacted pdfs at all, and for changing the length of time for showing previous Textract jobs
about 1 month ago
custom_csvlogger.py
Safe
12.8 kB
Updated logging format for timestamps to be compatible with AWS. Added load_dynamo_logs.py example file.
about 2 months ago
custom_image_analyser_engine.py
Safe
53.9 kB
Now local OCR outputs can be saved to file and reloaded to save preparation time. Bug fixing in logs and tabular data redaction. Update to documentation
about 2 months ago
data_anonymise.py
Safe
35.9 kB
Now local OCR outputs can be saved to file and reloaded to save preparation time. Bug fixing in logs and tabular data redaction. Update to documentation
about 2 months ago
file_conversion.py
Safe
100 kB
Added config options for compressing output pdfs, returning output redacted pdfs at all, and for changing the length of time for showing previous Textract jobs
about 1 month ago
file_redaction.py
119 kB
Added config options for compressing output pdfs, returning output redacted pdfs at all, and for changing the length of time for showing previous Textract jobs
about 1 month ago
find_duplicate_pages.py
Safe
9.87 kB
Corrected a couple of bugs. Now Textract whole document API call outputs will load also the input PDF into the app
about 1 month ago
helper_functions.py
Safe
26.3 kB
Now local OCR outputs can be saved to file and reloaded to save preparation time. Bug fixing in logs and tabular data redaction. Update to documentation
about 2 months ago
load_spacy_model_custom_recognisers.py
Safe
13.7 kB
Major update. General code revision. Improved config variables. Dataframe based review frame now includes text, items can be searched and excluded. Costs now estimated. Option for adding cost codes added. Option to extract text only.
2 months ago
presidio_analyzer_custom.py
Safe
4.92 kB
More config options. Fixed some bugs with removing elements from review page and Adobe export. Some UI rearrangements
3 months ago
redaction_review.py
Safe
81 kB
Added config options for compressing output pdfs, returning output redacted pdfs at all, and for changing the length of time for showing previous Textract jobs
about 1 month ago
textract_batch_call.py
27.9 kB
Added config options for compressing output pdfs, returning output redacted pdfs at all, and for changing the length of time for showing previous Textract jobs
about 1 month ago