This repo is my experimental projects on Data Engineering.
-
Updated
Mar 6, 2023 - Python
This repo is my experimental projects on Data Engineering.
Automate Apache Spark in Hadoop with Airflow in Cloud
Data Engineering Capstone Project - Udacity Data Engineering Expert Track.
Keywords: Python, Airflow, AWS, S3, Redshift, ETL
Udacity project within the Data Engineer Nanodegree
Leo CDP - Customer Data Platform for Smart Business
Kaggle's 'Bike Sharing Demand' competition
scripts and data for parade-db: a political issue tracker to support educated voting 📊 ☑️
PySpark Analysis from log files
This project is one of academic projects given to us in Geographic Information System (GIS) Course. Created by: Pranav Pandya (Me) and Kartikey Hadiya We sampled information for pollution emmision in Delhi, India. Pollution data was obtained from: https://data.gov.in/resources/real-time-air-quality-index-various-locations Pollution index data ca…
Constructing a protein fragment database in the context of Lyme disease.
A curated list of awesome data engineering resources using python
IEEE AIKE 2018 Conference Website
Analyzing Boston Airbnb Data using ML
Data Engineering (Udacity): Project 1 Data Modelling with PostgreSQL
Personal Repository for Data Engineering course, autumn semester 2020/2021 University of Tartu
Innovative Speech Interaction Systems with Complex Data Processing, Machine Learning and Spoken NLP Command Control
Add a description, image, and links to the data-engineering topic page so that developers can more easily learn about it.
To associate your repository with the data-engineering topic, visit your repo's landing page and select "manage topics."