This project is a part of Milestone 2 for the DMQL course. It involves building a normalized PostgreSQL database, loading real-world Amazon sales data, writing complex SQL queries, optimizing performance using EXPLAIN ANALYZE
, and optionally developing a Streamlit dashboard.
- ✅ PostgreSQL schema design with BCNF/3NF compliance
- ✅ Sample and production data loading
- ✅ Complex SQL queries: aggregation, filtering, JOINs
- ✅ Optimization using
EXPLAIN ANALYZE
and indexes - ✅ Query performance screenshots
- ✅ Bonus-ready: web dashboard setup (Streamlit supported)
- Set up PostgreSQL (locally or on a cloud instance)
- Run
create.sql
to create all tables - Place
.csv
files into an accessible path (e.g./tmp/
on Mac/Linux) - Edit paths in
load.sql
and run it to populate the tables - Execute queries from
solutions.sql
in PgAdmin or terminal - Use
explain_queries.sql
to analyze query performance - View saved screenshots in
/screenshots/
You can optionally implement a Streamlit dashboard under /webapp/
to:
- Show top-selling products
- Visualize category revenue
- Track monthly sales trends
This will earn bonus points if deployed and included in your final submission.
- Harshith Kumar Reddy Audipudi (haudipud)
- Lohith Vattikuti (lvattiku)
- Sohith Sai Malyala (sohithsa)
Spring 2025 – CSE 460/560: Data Models and Query Languages
Instructor: Dr. Sreyasee Das Bhattacharjee