In this project, we will identify the characteristics of women who are more likely to develop cervical cancer and use Automatic Machine Learning H2O AutoML to make prediction on whether or not a certain customer would buy a term deposit. We will also use Explainable AI (XAI) methods such as Variable Importance Plot, Partial Dependence Plot, SHAP Summary Plot, and LIME to explain how each of our feature input affects our model prediction.
This Cervical Cancer (Risk Factors) dataset comes from UCI Machine Learning Repository. The data was collected at Hospital Universitario de Caracas in Caracas, Venezuela. It comprises demographic information, habits, and historic medical records of 858 patients.