Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Amazon #44

Open
MeNsaaH opened this issue Sep 22, 2019 · 6 comments
Open

Add Amazon #44

MeNsaaH opened this issue Sep 22, 2019 · 6 comments

Comments

@MeNsaaH
Copy link
Member

MeNsaaH commented Sep 22, 2019

No description provided.

@MeNsaaH MeNsaaH added the enhancement New feature or request label Sep 22, 2019
@kaustavbhattacharya07
Copy link

Hello! I am interested in adding this enhancement. Is the requirement something related to extracting the top ten results for a particular search? Can you kindly tell me regarding the requirement?

@MeNsaaH
Copy link
Member Author

MeNsaaH commented Sep 30, 2019

Yeah. Given a search on Amazon, it should return the titles, description, links, price and ratings for the search. Check out the Contribution guide for more details on contribution

@deven96 deven96 added help wanted Extra attention is needed in progress labels Sep 30, 2019
@devajithvs
Copy link
Contributor

devajithvs commented Apr 22, 2020

I tried adding amazon. They have a stringent policy against web scraping. Every request returns an HTML page given below:

To discuss automated access to Amazon data, please contact [email protected].
For information about migrating to our APIs, refer to our Marketplace APIs at https://developer.amazonservices.com/ref=rm_c_sv, or our Product Advertising API at https://affiliate-program.amazon.com/gp/advertising/api/detail/main.html/ref=rm_c_ac for advertising use cases.

Enter the characters you see below
Sorry, we just need to make sure you're not a robot. For best results, please make sure your browser is accepting cookies

@MeNsaaH
Copy link
Member Author

MeNsaaH commented Apr 22, 2020

Wow, that's some serious stuff. Maybe look into the headers that can be passed @devajithvs

@devajithvs
Copy link
Contributor

devajithvs commented Apr 25, 2020

Tried that too. Exactly copying the headers in the browser didn't work. I guess they have some other mechanism to prevent scraping.

@MeNsaaH
Copy link
Member Author

MeNsaaH commented Apr 25, 2020

Alright, I think we'll have to look into options of using selenium libraries tho. Meanwhile, I'll try out some additional headers and see where it gets us

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants