beautifulsoup4 selenium scikit-learn transformers langdetect torch numpy webdriver-manager lxml os requests