Dimitrios Chasanidis
This repository contains solutions to a MapReduce assignment focused on analyzing car price data. It includes four tasks implemented in Python, designed to run in both standalone and distributed modes.
Assignment1_Dimitrios_Chasanidis.ipynb
: Jupyter notebook with all MapReduce tasks.
- The input dataset (
car_prices.csv
) is not included in this repository, because it is too big (80MB) for a GitHub repository. - A download link for the dataset is provided inside the notebook.
- Python 3.x
- Hadoop (for distributed mode)
This project is part of a data processing coursework and follows the provided submission guidelines.