streamlit torch transformers accelerate sentencepiece tiktoken einops pytest flash-attention