The project to write web page parser using Scrapy. The project implements a parser for collecting version information PEP (Python Enhancement Proposals)
The parser collects information (Number, Name и Status) about PEP documents from website https://peps.python.org/ and saves in csv file (directory result). The format of file name: pep(%datetime%).csv
Also parser counts the number of documents with a certain status, counts the total number of PEP documents and saves this information in a csv file (directory result) The format of file name: status_summary(%datetime%).csv
- Clone the repository to the local machine
git clone [email protected]:Andrey-Kugubaev/scrapy_parser_pep.git
- Install and activate the virtual environment
python -m venv venv
orpython3 -m venv venv
, thensource venv/Scripts/activate
orsource venv/bin/activate
- Install Dependencies
pip install -r requirements.txt
- Run parsers
scrapy crawl pep