requests gradio datasets scikit-learn huggingface_hub fpdf torch transformers spacy nltk transformers tiktoken blobfile sentencepiece tokenizer libgen-api