transformers torch scikit-learn openai python-docx astroquery pyvo pandas faiss-cpu PyMuPDF PyPDF2 numpy