Skip to content

Web scraping script for Google's search results page, built using Python and Scrapy.

License

Notifications You must be signed in to change notification settings

Muyoouu/google-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation


google-scraper

Python and Scrapy-based Google's search result scraping software

About The Project

Scraping data from Google search results presents two significant challenges. First, Google's search results are in a constant state of flux, making it difficult to maintain consistent and accurate data extraction. Second, Google has implemented stringent bot detection measures, creating a substantial obstacle to web scraping.

To address these challenges, I've developed specialized web scraping software with the following key features:

  1. Dynamic Web Page Navigation: My script includes instructions tailored to navigate Google's dynamic web pages effectively, allowing it to parse the ever-changing HTML content.

  2. Bypassing Bot Detection Measures: I've integrated proxy network services into the script to ensure that my scraping activities remain undetected as a bot by Google.

For a complete project demo, please visit the Google Scraper Notion Page.

(back to top)

Built With

  • Python
  • Scrapy
  • Pandas

(back to top)

License

Distributed under the MIT License. See LICENSE.txt for more information.

(back to top)

Contact

Musa Yohanes - [email protected]

Project Link: https://github.com/Muyoouu/google-scraper

(back to top)

About

Web scraping script for Google's search results page, built using Python and Scrapy.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages