Skip to content

The scraper runs but it returns an empty file #211

Answered by dcaribou
HaiFred asked this question in FAQ
Discussion options

You must be logged in to vote

Hey @HaiFred,

I noticed the following two lines in the log you provided

...
'downloader/request_count': 241
...
'httpcache/hit': 241,

This is suggesting that the crawler is indeed working since 241 requests is roughly what is to be expected for a run of the players crawler (I get about the same numbers when I run it from my local).

However the crawler is using your local cache to serve responses, and this could be why you are not getting any output. Scrapy may save responses to cache regardless of the return status, and it happened to me sometimes that I ended up with a corrupted cache because of this.

Can you try and run it with the cache deactivated? You can do so by changing the settin…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by dcaribou
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
FAQ
2 participants
Converted from issue

This discussion was converted from issue #184 on September 24, 2023 15:44.