Author: Mark Bauer
Welcome to the beginner's guide to DuckDB's Python client! This tutorial was crafted during my own journey of acquainting with DuckDB's Python client and aims to provide newcomers with a basic foundation in utilizing the API. The duckdb-python-basics notebook is based almost entirely on the official DuckDB Python documentation but in a Jupyter Notebook layout. The data-analysis notebook provides a sample analysis of Green Infrastructure projects in NYC.
While this guide serves as a valuable resource, I encourage users to complement their learning with the official documentation available on DuckDB's website for a comprehensive understanding.
For more advanced data analytics projects that utilize DuckDB, check out:
- Analyzing FEMA's National Flood Insurance Program (NFIP) Data With DuckDB
- FEMA Disaster Declarations and Public Assistance Data Analysis
- MTA Subway Origin-Destination Ridership Estimate for 2023
- Explore the beginner's notebook: duckdb-python-basics.ipynb
- A sample data analysis project examining Green Infrastructure projects in NYC: data-analysis.ipynb
- Download data: data-download.ipynb
NYC DEP Green Infrastructure Data: https://data.cityofnewyork.us/Environment/DEP-Green-Infrastructure/spjh-pz7h
This dataset contains the locations and detailed information of green infrastructure practices in NYC neighborhoods built primarily through NYC Green Infrastructure Program initiatives. Green infrastructure (GI) collects stormwater from streets, sidewalks, and other hard surfaces before it can enter the sewer system or cause local flooding. The GI practice data contained in this dataset includes the location, program area, status, and type of GI.
- Official DuckDB Documentation: https://duckdb.org/
- DuckDB Python client guide: https://duckdb.org/docs/api/python/overview
Feel free to reach out for further discussions.
- LinkedIn: markebauer
- GitHub: mebauer
- Portfolio: mebauer.github.io