[networking] Add response extensions for impersonate info #9756

bashonly · 2024-04-21T18:17:42Z

this can be useful during extractor error handling, e.g. in the crunchyroll extractor, where we want to expect the 403 error and a offer a cloudflare bypass hint if impersonation was not used, or else raise w/ a bug report message if impersonation was used.

Template

Before submitting a pull request make sure you have:

At least skimmed through contributing guidelines including yt-dlp coding conventions
Searched the bugtracker for similar pull requests
Checked the code with flake8 and ran relevant tests

In order to be accepted and merged into yt-dlp each piece of code must be in public domain or released under Unlicense. Check all of the following options that apply:

I am the original author of this code and I am willing to release it under Unlicense

What is the purpose of your pull request?

Core bug fix/improvement

Authored by: bashonly

pukkandan · 2024-04-23T13:45:10Z

Pls explain motivation for 3b48105. I liked the first impl much better

bashonly · 2024-04-23T14:46:03Z

@pukkandan

hypothetically, the value of impersonate can be None:

yt-dlp/yt_dlp/networking/impersonate.py

Lines 61 to 80 in 52f5be1

 class ImpersonateRequestHandler(RequestHandler, ABC): 

 """ 

  Base class for request handlers that support browser impersonation. 

  This provides a method for checking the validity of the impersonate extension, 

  which can be used in _check_extensions. 

  Impersonate targets consist of a client, version, os and os_ver. 

  See the ImpersonateTarget class for more details. 

  The following may be defined: 

  - `_SUPPORTED_IMPERSONATE_TARGET_MAP`: a dict mapping supported targets to custom object. 

  Any Request with an impersonate target not in this list will raise an UnsupportedRequest. 

  Set to None to disable this check. 

  Note: Entries are in order of preference 

  Parameters: 

  @param impersonate: the default impersonate target to use for requests. 

  Set to None to disable impersonation. 

  """

so if a response is an instance of ImpersonateResponse, then that is completely meaningless on its own. In order to determine if impersonation was used for a request, the extractor would also have to do check truthiness of the impersonate attribute, with the code looking like this:

from ..networking.impersonate import ImpersonateResponse

try:
    ...
except ExtractorError as e:
    if isinstance(e.cause, HTTPError) and e.cause.status == 403:
        if isinstance(e.cause.response, ImpersonateResponse) and e.cause.response.impersonate is not None:
            target = str(e.cause.response.impersonate)
            # handling for impersonation
        # handling for no impersonation

vs. with new impl:

try:
    ...
except ExtractorError as e:
    if isinstance(e.cause, HTTPError) and e.cause.status == 403:
        if target := e.cause.response.extras.get('impersonate'):
            # handling for impersonation
        # handling for no impersonation

pukkandan · 2024-04-23T14:50:21Z

Why not target := getattr(e.cause.response, 'impersonate', None)?

Alternatively, we could put an impersonate = None in normal Response if we wanna be able to check e.cause.response.impersonate safely

Authored by: bashonly

bashonly · 2024-04-23T19:51:14Z

Why not target := getattr(e.cause.response, 'impersonate', None)?

That should suffice, yeah. I've reverted to the orig impl (but kept the corrected type annotations)

Alternatively, we could put an impersonate = None in normal Response if we wanna be able to check e.cause.response.impersonate safely

IMO the base Response shouldn't concern itself with the extensions of its subclasses

Authored by: bashonly

yt_dlp/networking/common.py

Authored by: bashonly

[networking] Add ImpersonateResponse

64d4c4b

Authored by: bashonly

bashonly added enhancement New feature or request networking core networking related labels Apr 21, 2024

bashonly requested a review from coletdjnz April 21, 2024 18:17

bashonly mentioned this pull request Apr 21, 2024

[ie/crunchyroll] Fix auth and remove cookies support #9749

Merged

5 tasks

bashonly added 3 commits April 22, 2024 17:41

slight impl change

3b48105

Authored by: bashonly

Merge branch 'yt-dlp:master' into feat/impersonate-response

d66fa6a

fix test

f2f108b

Authored by: bashonly

revert to orig impl

d5d8339

Authored by: bashonly

bashonly added 3 commits April 23, 2024 18:04

new impl

b77ccbc

Authored by: bashonly

add tests

805f40a

Authored by: bashonly

oops

6c7bfb5

Authored by: bashonly

bashonly changed the title ~~[networking] Add ImpersonateResponse~~ [networking] Add response extensions for impersonate info Apr 23, 2024

coletdjnz approved these changes Apr 25, 2024

View reviewed changes

yt_dlp/networking/common.py Outdated Show resolved Hide resolved

bashonly added 3 commits May 4, 2024 11:29

Expose extensions as param to Response, document

0b95a4f

Authored by: bashonly

tiny refactor

ff84e9e

Authored by: bashonly

punctuation

62c8c39

Authored by: bashonly

Grub4K approved these changes May 4, 2024

View reviewed changes

bashonly merged commit bec9a59 into yt-dlp:master May 4, 2024
15 checks passed

bashonly deleted the feat/impersonate-response branch May 10, 2024 19:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[networking] Add response extensions for impersonate info #9756

[networking] Add response extensions for impersonate info #9756

bashonly commented Apr 21, 2024 •

edited

pukkandan commented Apr 23, 2024

bashonly commented Apr 23, 2024

pukkandan commented Apr 23, 2024 •

edited

bashonly commented Apr 23, 2024

[networking] Add response extensions for impersonate info #9756

[networking] Add response extensions for impersonate info #9756

Conversation

bashonly commented Apr 21, 2024 • edited

Before submitting a pull request make sure you have:

In order to be accepted and merged into yt-dlp each piece of code must be in public domain or released under Unlicense. Check all of the following options that apply:

What is the purpose of your pull request?

pukkandan commented Apr 23, 2024

bashonly commented Apr 23, 2024

pukkandan commented Apr 23, 2024 • edited

bashonly commented Apr 23, 2024

bashonly commented Apr 21, 2024 •

edited

pukkandan commented Apr 23, 2024 •

edited