Skip to content

Simple crawler made with scrapy and TorIpChanger package

License

Notifications You must be signed in to change notification settings

Santhin/TorScrapy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation


Simple app for scrapping data from gumtree.

🧐 About

The project was created for learning purposes to know how to combine scrapy framework with TorIp changer.

🏁 Getting Started

Prerequisites

  • Docker desktop

Project structure

.
├── docker-compose.yml
├── LICENSE
├── README.md
└── src
    ├── crawler
    │   ├── __init__.py
    │   ├── items.py
    │   ├── middlewares.py
    │   ├── pipelines.py
    │   ├── settings.py
    │   └── spiders
    │       ├── __init__.py
    │       ├── mieszkania2.py
    │       └── quotes_spider.py
    ├── Dockerfile
    ├── go_spider.py
    ├── scrapy.cfg
    └── tests
        └── ipchanger_works.py

Installing

Clone repository:

git clone https://github.com/Santhin/TorScrapy.git

To run the crawler type:

docker-compose up

🔧 Running the tests

Simple check if tor ip changer is working unmark commented test in dockerfile.
The exemplary output:

Project logo

🛠️ Todo

  • add control startup for TorIpChanger container in docker-compose

⛏️ Built Using

🎉 Acknowledgements