[extractor/tele5] Modified tele5 extractor to fix Issue #8501 #9792

JerryZhouSirui · 2024-04-26T17:53:05Z

IMPORTANT: PRs without the template will be CLOSED

Description of your pull request and other information

ADD DESCRIPTION HERE

Fixes #

Template

Before submitting a pull request make sure you have:

At least skimmed through contributing guidelines including yt-dlp coding conventions
Searched the bugtracker for similar pull requests
Checked the code with flake8 and ran relevant tests

In order to be accepted and merged into yt-dlp each piece of code must be in public domain or released under Unlicense. Check all of the following options that apply:

I am the original author of this code and I am willing to release it under Unlicense
I am not the original author of this code but it is in public domain or released under Unlicense (provide reliable evidence)

What is the purpose of your pull request?

Fix or improvement to an extractor (Make sure to add/update tests)
New extractor (Piracy websites will not be accepted)
Core bug fix/improvement
New feature (It is strongly recommended to open an issue first)

pukkandan

partial review b4 I realized the PR is fundamentally broken

pukkandan · 2024-04-26T18:48:18Z

yt_dlp/extractor/beatport.py

+ try:
+ playables_json = self._search_regex(
+ r'window\.Playables\s*=\s*({.+?})\s*;', webpage,
+ 'playables info', default='{}', flags=re.DOTALL)
+ playables = self._parse_json(playables_json, track_id)
+ except re.error:
+ raise ExtractorError('Failed to extract playables information. The page structure may have changed.')


We might as well convert this to self._search_json since we are editing it

pukkandan · 2024-04-26T18:54:53Z

yt_dlp/extractor/beatport.py

+ if not playables or 'tracks' not in playables:
+ raise ExtractorError('No playable tracks found in the extracted information.')

- title = ', '.join((a['name'] for a in track['artists'])) + ' - ' + track['name']
- if track['mix']:
+ track = next((t for t in playables['tracks'] if t['id'] == int(track_id)), None)
+ if not track:
+ raise ExtractorError(f'No track with ID {track_id} found.')
+
+ title = ', '.join(a['name'] for a in track['artists']) + ' - ' + track['name']
+ if track.get('mix'):
 title += ' (' + track['mix'] + ')'


Suggested change

if not playables or 'tracks' not in playables:

raise ExtractorError('No playable tracks found in the extracted information.')

title = ', '.join((a['name'] for a in track['artists'])) + ' - ' + track['name']

if track['mix']:

track = next((t for t in playables['tracks'] if t['id'] == int(track_id)), None)

if not track:

raise ExtractorError(f'No track with ID {track_id} found.')

title = ', '.join(a['name'] for a in track['artists']) + ' - ' + track['name']

if track.get('mix'):

title += ' (' + track['mix'] + ')'

track = traverse_obj(playables, ('tracks', lambda _, t: t['id'] == int(track_id), {dict}))

if not track:

raise ExtractorError(f'No track with ID {track_id} found')

title = join_nonempty(

', '.join(traverse_obj(track, ('artists', ..., 'name'))),

track.get('name'), format_field(track, 'mix', '(%s)'))

pukkandan · 2024-04-26T18:56:19Z

supportedsites.md

@@ -503,6 +503,7 @@
 - **gem.cbc.ca**: [*cbcgem*](## "netrc machine")
 - **gem.cbc.ca:live**
 - **gem.cbc.ca:playlist**
+ - **generic**: Generic downloader that works on some sites


pukkandan · 2024-04-26T18:56:47Z

yt_dlp/extractor/beatport.py

 return {
- 'id': compat_str(track.get('id')) or track_id,
- 'display_id': track.get('slug') or display_id,
+ 'id': compat_str(track.get('id', track_id)),
+ 'display_id': track.get('slug', display_id),
 'title': title,
 'formats': formats,
- 'thumbnails': images,
- }
+ 'thumbnails': images
+ }


pukkandan · 2024-04-26T19:00:09Z

yt_dlp/extractor/tele5.py

@@ -1,17 +1,68 @@
+import re
+
+import requests


Do not use requests. All network access should go through the helper functions

pukkandan · 2024-04-26T19:00:41Z

yt_dlp/extractor/tele5.py

- if getattr(e, 'message', '') == 'Missing deviceId in context':
- self.report_drm(video_id)
- raise
+ content_regex = re.compile(r'https?://(?:www\.)?(?P<environment>[^.]+)\.de/(?P<parent_slug>[^/]+)/(?P<slug>[^/?#&]+)')


self._search_regex

pukkandan · 2024-04-26T19:00:54Z

yt_dlp/extractor/tele5.py

+ referer=url,
+ url='https://de-api.loma-cms.com/feloma/configurations/?environment={0}'.format(environment))
+
+ site_info = cached_base.get('data').get('settings').get('site')


traverse_obj

pukkandan · 2024-04-26T19:04:43Z

This is just a copy of #8501 with unrelated beatport changes added on top??? Pls explain

Ignore above review until this is resolved

JerryZhouSirui added 2 commits April 26, 2024 13:48

Modified tele5 extractor

7565fcb

Fix beatport extractor

a4d4809

pukkandan requested changes Apr 26, 2024

View reviewed changes

pukkandan reviewed Apr 26, 2024

View reviewed changes

pukkandan requested changes Apr 26, 2024

View reviewed changes

pukkandan added site-bug Issue with a specific website pending-fixes PR has had changes requested labels Apr 26, 2024

pukkandan added the invalid This doesn't seem right label Apr 26, 2024

pukkandan mentioned this pull request Apr 27, 2024

[Extractor | Tele5] Fix Tele5 Extractor #9796

Closed

9 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[extractor/tele5] Modified tele5 extractor to fix Issue #8501 #9792

[extractor/tele5] Modified tele5 extractor to fix Issue #8501 #9792

JerryZhouSirui commented Apr 26, 2024

pukkandan left a comment •

edited

pukkandan Apr 26, 2024

pukkandan Apr 26, 2024

pukkandan Apr 26, 2024

pukkandan Apr 26, 2024

pukkandan Apr 26, 2024

pukkandan Apr 26, 2024

pukkandan Apr 26, 2024

pukkandan commented Apr 26, 2024 •

edited

[extractor/tele5] Modified tele5 extractor to fix Issue #8501 #9792

Are you sure you want to change the base?

[extractor/tele5] Modified tele5 extractor to fix Issue #8501 #9792

Conversation

JerryZhouSirui commented Apr 26, 2024

Description of your pull request and other information

Before submitting a pull request make sure you have:

In order to be accepted and merged into yt-dlp each piece of code must be in public domain or released under Unlicense. Check all of the following options that apply:

What is the purpose of your pull request?

pukkandan left a comment • edited

Choose a reason for hiding this comment

pukkandan Apr 26, 2024

Choose a reason for hiding this comment

pukkandan Apr 26, 2024

Choose a reason for hiding this comment

pukkandan Apr 26, 2024

Choose a reason for hiding this comment

pukkandan Apr 26, 2024

Choose a reason for hiding this comment

pukkandan Apr 26, 2024

Choose a reason for hiding this comment

pukkandan Apr 26, 2024

Choose a reason for hiding this comment

pukkandan Apr 26, 2024

Choose a reason for hiding this comment

pukkandan commented Apr 26, 2024 • edited

pukkandan left a comment •

edited

pukkandan commented Apr 26, 2024 •

edited