Skip to content
/ sml Public

This machine learning tool mainly focus on using `Tensorflow Extended` library to train machine learning model using data from various data storage.

Notifications You must be signed in to change notification settings

evolv34/sml

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SML - Simple Machine learning

This machine learning pipeline tool focuses mainly on using Tensorflow Extended library to train machine learning model using data from various data storage.

Architecture

Alt Image text

Description

Notebook files are stored in /notebooks folder

File Description
IEEE-CIS-Fraud-Detection-preprocessor.ipynb pyspark preprocessor notebook
IEEE-CIS-Fraud-Detection-Train-TF.ipynb Tensorflow extended model training and publishing code.
IEEE-CIS-Fraud-Detection-Score-Spark.ipynb pyspark score notebook.

Installation

The stack is deployed using docker and docker-compose. docker and docker-compose are prerequisite.

  docker-compose -f sml.yml up -d

Test Scenario

This pipeline used IEEE-CIS Fraud Detection data from kaggle. And in the first iteration it was able to achive considerable ok score.

  • features were selected based on backward elemination technique.

Alt Image text

Note:

Model improvement is beyond the scope of this repository.

About

This machine learning tool mainly focus on using `Tensorflow Extended` library to train machine learning model using data from various data storage.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published