streamlit fpdf PyPDF2 pytesseract pdf2image transformers torch scikit-learn