streamlit groq scikit-learn numpy pandas transformers torch PyPDF2 python-docx