Skip to content

EDA and Machine Learning Models in R and Python (Regression, Classification, Clustering, SVM, Decision Tree, Random Forest, Time-Series Analysis, Recommender System, XGBoost)

Notifications You must be signed in to change notification settings

ashish-kamboj/Data-Science

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Packages/Libraries used for data analyis and building Machine learning models

Data Operations Python R
Data Manipulation Pandas dplyr, plyr, tidyr, stringr, data.table, lubridate(for date manipulation)
Data Visualization matplotlib, seaborn ggplot2, cowplot, ggthemes, scales
Recommender Model recommenderlab
Text Mining nltk, spaCy tm, tidyverse
ML Models scikit-learn, PyCaret randomForest, rpart, caret, lm, glm, forecast, tseries, kernlab


ML and EDA Projects

Projects Algorithms Programming Languages
Abnormal Blood Pressure Classification Logistic Regression, Decision Trees, Random Forest, XGBoost, LightGBM and other classification algorithms Python
AirBnB Price Prediction XGBoost R
Amaze Payment Solution EDA EDA R
Amazon Marketplace Best Sellers Identification K-Means Python
Beer Recommendation System Collaborative Filtering, Content-Based Filtering R
Breast Cancer Prediction AdaBoost Python
Car Pricing Model Linear Regression R
Credit Card Defaulter Random Forest R, Python
Credit Risk Analysis EDA R
Credit Worthiness For Rural India Linear/Lasso/Ridge/Elastic Net Regression, Decision Tree Regressor, Random Forest Regressor and other Regression algorithms Python
Customer Segmentation K-Means, hierarchical clustering R
Digital Media Company Viewership Prediction Python
Email Classification Linear SVM R, Python
Employee Attrition Model Logistic Regression R
Global Investment Trends EDA R
Handwritten Digit Recognition SVM (Linear and RBF) R
Heart Disease Classification Decision Tree Python
Housing Price Prediction Linear Regression(OLS) Python
Letter Recognition SVM (Linear and RBF) Python
Loan Defaulter-EDA EDA R
Monthly Income Decision Tree R
Movie Recommendation System Collaborative Filtering, Content-Based Filtering R
RTO Prediction Logistic Regression, Decision Trees, Random Forest, XGBoost, LightGBM and other classification algorithms Python
SMS Classification Multinomial and Bernoulli Naive Bayes Python
Saavn_Ecomm_Ads_Segmentation Clustering (k-prototype) R
Sales and Demand Forecasting Time-Series (ARMA, ARIMA) R
Telecom Churn Model Logistic Regression R, Python
Transaction-data-analysis-and-prediction Time-Series(ARIMA) R
Uber Supply-Demand Gap EDA R


Additional Reading

Click to expand!

Related Mathematics

Model Evaluation (here)

  • Regression
    • R-squared/Adj. R-squared
    • Root Mean Squared Error(RMSE) / Mean Squared Error
    • Mean Absolute Error(MAE)
  • Classification (here)
    • Accuracy, Precision, and Recall
    • Log Loss/Binary Crossentropy
    • Categorical Crossentropy
    • Confusion Matrix
    • F1 Score
    • AUC

Releases

No releases published

Packages

No packages published