In this folder, we have all of the datasets we managed to find online. This folder is further subdivided into two folders: classification and regression.
The classification folder has the next datasets:
- Heart: heart.dat (Link)
- Breast cancer: dataset_13_breast-cancer.arff (Link)
- Haberman: haberman.data (Link)
- Ionosphere: ionoshpere.data (Link)
- Diabetes: diabetes.csv (Link)
- German credit: SouthGermanCredit.asc (Link)
- Juvenile: juvenile.xpt (Link)
- Recidivism: compas-scores-two-years.csv (Link)
The regression folder has the next datasets:
- Geographical music: geographical_music.tsv (Link)
- Red wine: winequality-red.csv (Link)
- Abalone: abalone.data (Link)
- Satellite image: satellite_image.tsv (Link)
- CA housing: ca_housing.data (Link)
Python package scikit-learn
was used for the next three regression datasets: