pdf2image pypdf tiktoken langchain langchain-community langchain-huggingface chromadb InstructorEmbedding huggingface_hub==0.25.2