Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update README.md #14

Open
wants to merge 1 commit into
base: master
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions 1. Postgres ETL/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@ This repo provides the ETL pipeline, to populate the sparkifydb database.

* This source of data will be useful in helping Sparkify reach some of its analytical goals, for example, finding out songs that have highest popularity or times of the day which is high in traffic.

## Database Design and ETL Pipeline
## Database Design and ETL Pipeline Schema for Song Play Analysis
---
* For the schema design, the STAR schema is used as it simplifies queries and provides fast aggregations of data.

Expand All @@ -21,4 +21,4 @@ This repo provides the ETL pipeline, to populate the sparkifydb database.
---
* First, run create_tables.py to create the data tables using the schema design specified. If tables were created previously, they will be dropped and recreated.

* Next, run etl.py to populate the data tables created.
* Next, run etl.py to populate the data tables created.