This project involves building and training a machine learning model using PySpark to predict customer churn. The dataset contains customer attributes and churn status, and it goes through data exploration and preprocessing steps initially. Then, machine learning algorithms such as logistic regression and gradient boosting machine (GBM) are used to build the model. Finally, various metrics are used to evaluate the model's performance, and model hyperparameters are tuned using CrossValidator.
-
Notifications
You must be signed in to change notification settings - Fork 0
mesudepolat/Churn-PySpark
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Churn Prediction using PySpark
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published