I used a support vector machine (SVM) model to classify UNESCO World Heritage Sites. The model was built using data from UNESCO, Wikipedia, the US National Park Service, the Canadian National Park Service, and TripAdvisor.
I used a PostgreSQL database on an AWS cloud server to store my data.
I build my model using scikit-learn. The final SVM model improved on an initial baseline F1 score of 48% to 62%.
There are 3 notebooks in this repository:
- Get CSVs from PostgreSQL
- Cleaning the data
- Modeling
This is my blog post on the project.