Skip to content
This repository has been archived by the owner on May 5, 2020. It is now read-only.

Different results based on allowed file formats #30

Open
siccovansas opened this issue Feb 20, 2015 · 3 comments
Open

Different results based on allowed file formats #30

siccovansas opened this issue Feb 20, 2015 · 3 comments
Labels

Comments

@siccovansas
Copy link

When I search rijksoverheid.nl (the main website of the Dutch government) I retrieve 384 files (60 xlsx, 323 xls, 1 unknown/xlsm) with the default set of allowed file formats (as specified in constants.py). When I add 'ods' to the list of allowed file formats and retry rijksoverheid.nl then I only retrieve 298 files (24 xlsx, 149 xls, 124 ods, 1 unknown/xlsm). I have tested both versions 3 times and the results were always like this.

So, how can it be that I retrieve less files when I add a file format?

@siccovansas siccovansas changed the title Different results base on allowed file formats Different results based on allowed file formats Feb 20, 2015
@waldoj
Copy link
Member

waldoj commented Feb 21, 2015

You might try running an equivalent search on Bing, searching for a series of filetypes—once with ODS, once without—and see if you can replicate the results. (This service is powered by Bing's API.) My suspicion is that this problem is on Bing's end.

@waldoj
Copy link
Member

waldoj commented Feb 21, 2015

I just ran that search on Bing, and I get 635 results with ODS, and 521 results without. (Search engines being what they are, it would not surprise me to find out that we're being shown different results for the same queries.) So that would seem to point to a problem with LMGTDFY, and not a problem with Bing.

@waldoj waldoj added the bug label Feb 21, 2015
@siccovansas
Copy link
Author

siccovansas commented Feb 21, 2015 via email

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests

2 participants