Bookstore Web scraping

This is a Pyhton based web scraping project for machine learning portfolio. This project is associated with https://books.toscrape.com/ website which is specially design for training web scraping.

In this project I have built a mechanism to collect information about every book in the website, scraping through pagination.

Finally, I have made some conclusions about the data I have collected, using graphical representations like charts and graphs.

Installation

There are two main libraries that I have used for this project.

  # this will install 'requests' library
  !pip install requests

  # this command will install BeutifulSoup4
  !pip install bs4

🏆 Lessons Learned

Usage of Requests library to extract text content from a webpage.
Usage of BeautifulSoup4 library to filter-out the data we need from html/xml content.
Preparing a DataFrame using extracted data from webpage
Deriving some conclusions from scraped data

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
BookStore_WEB_SCRAPING_PROJECT.ipynb		BookStore_WEB_SCRAPING_PROJECT.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bookstore Web scraping

Installation

🏆 Lessons Learned

About

Releases

Packages

Languages

tharangachaminda/bookstore_webscraping

Folders and files

Latest commit

History

Repository files navigation

Bookstore Web scraping

Installation

🏆 Lessons Learned

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages