Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
seanpedrickcase
/
document_redaction
like
4
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
document_redaction
/
tools
3 contributors
History:
85 commits
seanpedrickcase
Fuzzy match implementation for deny list. Added option to merge multiple review files. Review files from redaction step should now include text.
bde6e5b
14 days ago
__init__.py
Safe
0 Bytes
Initial commit
10 months ago
auth.py
Safe
2.92 kB
Ensured the text ocr outputs have no line breaks at end. Multi-line custom text searches now possible. Files for review sent from redact button. Fixed image redaction (not review yet). Can get user pool details from headers. Gradio update.
20 days ago
aws_functions.py
Safe
7.37 kB
Fixed issue where redactions were sometimes not removing text underneath boxes. You can now redact in different colours from review page
about 2 months ago
aws_textract.py
Safe
11.8 kB
Fix bug to identify all handwriting labels. Now only concatenates entity_type boxes if they have different labels.
28 days ago
cli_redact.py
Safe
4.73 kB
Allowed for overwriting of default output folder in choose_and_run_redactor function.
3 months ago
custom_csvlogger.py
Safe
6.65 kB
Created custom csvlogger to try to overcome AWS Lambda's incompatibility with multithread locks
3 months ago
custom_image_analyser_engine.py
Safe
49.2 kB
Fuzzy match implementation for deny list. Added option to merge multiple review files. Review files from redaction step should now include text.
14 days ago
data_anonymise.py
Safe
20.9 kB
Fuzzy match implementation for deny list. Added option to merge multiple review files. Review files from redaction step should now include text.
14 days ago
file_conversion.py
Safe
36.6 kB
Fuzzy match implementation for deny list. Added option to merge multiple review files. Review files from redaction step should now include text.
14 days ago
file_redaction.py
Safe
94.4 kB
Fuzzy match implementation for deny list. Added option to merge multiple review files. Review files from redaction step should now include text.
14 days ago
find_duplicate_pages.py
Safe
9.63 kB
Added tab to be able to compare pages across multiple documents and redact duplicates
18 days ago
helper_functions.py
Safe
14 kB
Fuzzy match implementation for deny list. Added option to merge multiple review files. Review files from redaction step should now include text.
14 days ago
load_spacy_model_custom_recognisers.py
Safe
13.7 kB
Fuzzy match implementation for deny list. Added option to merge multiple review files. Review files from redaction step should now include text.
14 days ago
presidio_analyzer_custom.py
Safe
4.94 kB
Added support for AWS Comprehend for PII identification. OCR and detection results now written to main output
3 months ago
redaction_review.py
Safe
28.1 kB
Fuzzy match implementation for deny list. Added option to merge multiple review files. Review files from redaction step should now include text.
14 days ago