New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add new __NEXT_DATA__ parsing for videos on BBC #9701
Comments
IIRC, the nextjs data is only present when from requesting the webpages from a US IP address. If you are going to open a PR, make sure that none of the current extraction strategies are removed/replaced -- this should only be done in addition to them. And use the |
Indeed, no As BBC content shown outside UK can be ad-supported, there may be a different front-end to the asset store, perhaps even developed separately from the one seen at home. It might be helpful for OP to post an example of the |
DO NOT REMOVE OR SKIP THE ISSUE TEMPLATE
Checklist
Region
United States
Example URLs
https://www.bbc.com/news/uk-68546268
https://www.bbc.com/news/world-middle-east-68778149
https://www.bbc.com/reel/video/p07c6sb6/how-positive-thinking-is-harming-your-happiness
Provide a description that is worded well enough to be understood
BBC.py needs a new clause to capture video from some websites that have video data within a structure called NEXT_DATA in the website data. The current regex/json parsers do not find these videos. In the example, the code fails after the last regex parser fails to locate the link.
I have a fix for this which I will add which captures the new tag from the web data that looks like
<script id="__NEXT_DATA__" type="application/json">
using an added regex parser. From within this, all the parameters for a video are found.Provide verbose output that clearly demonstrates the problem
yt-dlp -vU <your command line>
)'verbose': True
toYoutubeDL
params instead[debug] Command-line config
) and insert it belowComplete Verbose Output
The text was updated successfully, but these errors were encountered: