Skip to content

dchasanidis/BDCC

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

MapReduce - Car Prices

Author

Dimitrios Chasanidis

Description

This repository contains solutions to a MapReduce assignment focused on analyzing car price data. It includes four tasks implemented in Python, designed to run in both standalone and distributed modes.

Contents

  • Assignment1_Dimitrios_Chasanidis.ipynb: Jupyter notebook with all MapReduce tasks.

Notes

  • The input dataset (car_prices.csv) is not included in this repository, because it is too big (80MB) for a GitHub repository.
  • A download link for the dataset is provided inside the notebook.

Requirements

  • Python 3.x
  • Hadoop (for distributed mode)

This project is part of a data processing coursework and follows the provided submission guidelines.

About

Big Data and Cloud Computing - Data Science MSc

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published