Skip to content

HarshithReddy-Audipudi/amazon-sales-analysis

Repository files navigation

📦 Amazon USA Sales Analysis – DMQL Project (Milestone 2)

This project is a part of Milestone 2 for the DMQL course. It involves building a normalized PostgreSQL database, loading real-world Amazon sales data, writing complex SQL queries, optimizing performance using EXPLAIN ANALYZE, and optionally developing a Streamlit dashboard.


📁 Project Structure


✅ Features

  • ✅ PostgreSQL schema design with BCNF/3NF compliance
  • ✅ Sample and production data loading
  • ✅ Complex SQL queries: aggregation, filtering, JOINs
  • ✅ Optimization using EXPLAIN ANALYZE and indexes
  • ✅ Query performance screenshots
  • ✅ Bonus-ready: web dashboard setup (Streamlit supported)

🚀 How to Run

  1. Set up PostgreSQL (locally or on a cloud instance)
  2. Run create.sql to create all tables
  3. Place .csv files into an accessible path (e.g. /tmp/ on Mac/Linux)
  4. Edit paths in load.sql and run it to populate the tables
  5. Execute queries from solutions.sql in PgAdmin or terminal
  6. Use explain_queries.sql to analyze query performance
  7. View saved screenshots in /screenshots/

📊 Bonus (Optional Web App)

You can optionally implement a Streamlit dashboard under /webapp/ to:

  • Show top-selling products
  • Visualize category revenue
  • Track monthly sales trends

This will earn bonus points if deployed and included in your final submission.


👨‍💻 Contributors

  • Harshith Kumar Reddy Audipudi (haudipud)
  • Lohith Vattikuti (lvattiku)
  • Sohith Sai Malyala (sohithsa)

📅 Semester

Spring 2025 – CSE 460/560: Data Models and Query Languages
Instructor: Dr. Sreyasee Das Bhattacharjee


About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •