Skip to content

This repository contains files from a client project aimed at scraping dentists' information across Indonesia.

Notifications You must be signed in to change notification settings

darren7753/dentists_data_collection

Repository files navigation

Scraping Dentists' Information

This project was commissioned by a client on 2023-09-26. If you're interested in similar work, check out my freelance data analyst profile on Fastwork.

Objective

The aim of this project was to scrape dentists' information across Indonesia, specifically focusing on their name, address, and phone number. The data was sourced from the following links:

Implementation

To carry out the scraping process, I utilized Selenium. Since each link presents its own unique format, especially the fifth link which is vastly different from the others, I crafted a dedicated script for each source. For instance, scraping_1_link.py corresponds to the first link, automated by scraping_1_link.yml for GitHub Actions and the data is saved in the Data folder. This pattern was followed for the remaining links. Post-scraping, all data collected from the individual sources was unified into a single Excel file, ensuring a comprehensive dataset.

Thank you for reviewing this repository. Please don't hesitate to reach out for further information or collaboration opportunities.

About

This repository contains files from a client project aimed at scraping dentists' information across Indonesia.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages