Hey all. I’m running/testing an application here Videomatch - a Hugging Face Space by Iskaj, but I run into the issue where there is a discrepancy using the space locally or on Spaces.
Running the code.
url = "https://rr2---sn-5hne6nzk.googlevideo.com/videoplayback?expire=1665421952&ei=IP5DY7mjMYi71wLCu6XgCA&ip=,24007246&c=WEB&txp=4438434&n=eIEl-8KvnxIFRjF&sparams=expire%2Cei%2Cip%2Cid%2Citag%2Csource%2Crequiressl%2Cvprv%2Cmime%2Cns%2Cgir%2Cclen%2Cratebypass%2Cdur%2Clmt&sig=AOq0QJ8wRgIhAO7A4SkbzFwmQ2N6wBKGGfoBPhkX-8hQ0EZXngc-V4-2AiEA1zcKldSWoOvUD5S4VvTxC7kn9Nbo7DX4LiqPO0-RgOI%3D&redirect_counter=1&rm=sn-5hnesd7l&req_id=cb4d5150b17ea3ee&cms_redirect=yes&cmsv=e&ipbypass=yes&mh=6Q&mip=,mh,mip,mm,mn,ms,mv,mvi,pl&lsig=AG3C_xAwRgIhAPY0LyNdQ-k5EjKC2V9m1gRhOhQ6KxCl_9nhJeSNjmgGAiEA5gdstdSbaXsHmgpybV7LM3n6Brke5GH0xtR1FKRbOO4%3D"
req = urllib.request.Request(
headers={'User-Agent': 'Mozilla/5.0'}
with (urllib.request.urlopen(req, timeout=300)) as f, open(filepath, 'wb') as fileout:
shutil.copyfileobj(f, fileout, length=16*1024)
Gives back a 403 access denied error, but ONLY when running it in HF Spaces, not when running it locally. I already tried stuff like including the header {‘User-Agent’: ‘Mozilla/5.0’}, because I thought it might block scraping or something.
Anyone know why this discrepancy occurs between running locally and running via Spaces?
method should be a string that indicates the HTTP request method that will be used (e.g. ‘HEAD’). If provided, its value is stored in the method attribute and is used by get_method(). The default is ‘GET’ if data is None or ‘POST’ otherwise. Subclasses may indicate a different default method by setting the method attribute in the class itself.
But I did manage to solve it for my usecase by downloading the video (which is a youtube video) directly using pytube instead of via this provided link. Just in case someone out there is trying the same thing and hits this issue.