Automated Cancer Diagnostics via Analysis of Optical and Chemical Images by Deep and Shallow Learning

This is the repository for the paper "Automated cancer diagnostics via analysis of optical and chemical images by deep and shallow learning".

Deep Learning

Pre-processing of training data (WSI)

For the pre-processing of the training data, the default pipeline presented in the paper "Giunchiglia, V., McKenzie, J., and Takats Z., "WSIQC: whole slide images’ pre-processing pipeline for quality control assessment and AI-based data analysis", in preparation, 2022" was used. The code will be available at this link https://github.com/valegiunchiglia/wsi_pre_processing.

Pre-processing of test data (TMA FFPE)

The main pre-processing steps of FFPE images were:

Automated splitting of the TMA image into patches each containing one core
Detection and removal of the background from high resolution images
Tiling of high resolution images
Selection of the tiles containing less than a specific propotion of background
Preparation of the dictionaries to input into the neural network

Input data

The test dictionary must have the following keys:

slides: list of paths to the images
grid: list of list of tuples of (x, y) coordinates
targets: list of targets (either 1 or 0)
h5files: list of paths to the h5 files (one for each image)

The probability dictionary must have the following keys:

probs: list of probabilities obtained as output of the ML model
slideIDX: list of indices that identify to which image each tile belongs (tiles from the same image will have the same index)

Basic usage

# Create the environment, and then activate it
$ conda create --name tma python=3.7.1 ipykernel
$ conda activate tma


# First command pre-processing pipeline

python3 pre_processing_tma_ffpe.py \
  --path_to_slides [path/to/images] \
  --output [path/to/output/directory] \
  --output_h5 [path/to/output/directory/h5files]
 
Optional arguments:
--thresh_back                Upper threshold of proportion of background allowed in a tile
--tile_size                  Size of tiles extracted from the Whole Slide Image


# First command for the calculation of performance metrics

python3 confusion_matrix_ffpe.py \
  --path_to_predictions [path/to/output/predictions] \
  --test_dictionary [path/to/test/dictionary] \
  --output [path/to/output/directory] \
  --path_to_images [path/to/images]

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.gitignore		.gitignore
README.md		README.md
confusion_matrix_ffpe.py		confusion_matrix_ffpe.py
constants.py		constants.py
environment.yml		environment.yml
pre_processing_tma_ffpe.py		pre_processing_tma_ffpe.py
tma_coords.py		tma_coords.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automated Cancer Diagnostics via Analysis of Optical and Chemical Images by Deep and Shallow Learning

Deep Learning

Pre-processing of training data (WSI)

Pre-processing of test data (TMA FFPE)

Input data

Basic usage

About

Releases

Packages

Languages

valegiunchiglia/TMA-Analysis

Folders and files

Latest commit

History

Repository files navigation

Automated Cancer Diagnostics via Analysis of Optical and Chemical Images by Deep and Shallow Learning

Deep Learning

Pre-processing of training data (WSI)

Pre-processing of test data (TMA FFPE)

Input data

Basic usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages