Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

realitybloger.wordpress.com fails to be crawled #442

Open
benoit74 opened this issue Dec 9, 2024 · 0 comments
Open

realitybloger.wordpress.com fails to be crawled #442

benoit74 opened this issue Dec 9, 2024 · 0 comments
Labels
bug scraping_issue Issue occured while using the scraper
Milestone

Comments

@benoit74
Copy link
Collaborator

benoit74 commented Dec 9, 2024

Someone insisted quite a lot on this website.

It always fails on seed page with strange timeouts while doing link extraction and looking for page title

@benoit74 benoit74 added the bug label Dec 9, 2024
@benoit74 benoit74 added the scraping_issue Issue occured while using the scraper label Mar 10, 2025
@benoit74 benoit74 added this to the later milestone Mar 10, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug scraping_issue Issue occured while using the scraper
Projects
None yet
Development

No branches or pull requests

1 participant