This project provides a lightweight, flexible ETL (Extract, Transform, Load) pipeline implemented in Python. It demonstrates core ETL principles by extracting data from various sources, applying transformations, and loading the processed data into a target destination.
- Data extraction from multiple sources (CSV, JSON, APIs)
- Flexible data transformation capabilities
- Support for multiple destination formats
- Logging and error handling
- Configurable pipeline stages
- Python 3.8+
- pip (Python package manager)
- Clone the repository:
git clone https://github.com/rishipatel9/python-elt-project.git
cd python-elt-project
- Create a virtual environment:
python -m venv venv
source venv/bin/activate # On Windows, use `venv\Scripts\activate`
- Install dependencies:
pip install -r requirements.txt