MoCo-CXR: MoCo Pretraining Improves Representations and Transferability of Chest X-Ray Models

This repository contains

A modified version of the MoCo paper to accomodate for the CheXpert dataset
A modified version of the original implementation of the CheXpert paper

Preprint of this work is available on arXiv

@article{sowrirajanmoco,
  title={MoCo-CXR: MoCo Pretraining Improves Representation and Transferability of Chest X-ray Models},
  author={Sowrirajan, Hari and Yang, Jingbo and Ng, Andrew Y and Rajpurkar, Pranav}
}

This work has been presented in

ACM Conference on Health, Inference and Learning (CHIL 2020) workshop
Medical Imaging with Deep Learing (MIDL 2021)

Abstract

Contrastive learning is a form of self-supervision that can leverage unlabeled data to produce pretrained models. While contrastive learning has demonstrated promising results on natural image classification tasks, its application to medical imaging tasks like chest X-ray interpretation has been limited. In this work, we propose MoCo-CXR, which is an adaptation of the contrastive learning method Momentum Contrast (MoCo), to produce models with better representations and initializations for the detection of pathologies in chest X-rays. In detecting pleural effusion, we find that linear models trained on MoCo-CXR-pretrained representations outperform those without MoCo-CXR-pretrained representations, indicating that MoCo-CXR-pretrained representations are of higher-quality. End-to-end fine-tuning experiments reveal that a model initialized via MoCo-CXR-pretraining outperforms its non-MoCo-CXR-pretrained counterpart. We find that MoCo-CXR-pretraining provides the most benefit with limited labeled training data. Finally, we demonstrate similar results on a target Tuberculosis dataset unseen during pretraining, indicating that MoCo-CXR-pretraining endows models with representations and transferability that can be applied across chest X-ray datasets and tasks.

Methods

MoCo-CXR uses momemtum contrast as an unsupervised training method. This method maximizes agreement between augmentations of the same images while increases distances with the momemtum-weighted negative embedding.

MoCo-CXR is based on ResNet initialize weights, then trained in an unsupervised manner. Supervised learning is performed on different label fractions for the CheXpert dataset and the Shenzhen dataset.

Evaluation

Comparison of MoCo-CXR performance against ResNet initialized baseline when only the linear layers are fine tuned.

Comparison of MoCo-CXR performance against ResNet initialized baseline when all layers are allowed to be tuned.

Checkpoints

Note that these checkpoints follow MoCo's implementation. To re-use them for ImageNet-like training process, you will have to "hack" the checkpoint weights using ways similar to our model saver.

Running the experiments

Pre-Training

Note that the above naming includes "dot", that is, 00001 means 0.0001=1e-4. Our experiments are conducted on Stanford's SLURM. For reference, the training script used is here. Alternatively, if you are runng it on a "vanilla" machine, you can refence this script. You could also referene a generation script if you would like to generate commands for different learning rate and/or backbone model.

MoCo-CXR Training with CheXpert

We used splitting scripts like this to split data into traininig and validation sets. These also generate the various draws to produce confidence interval for evaluation of our semi-supervised approach.

For the Shenzhen dataset, we used this to convert the unpacked Shenzhen files into CheXpert's default format for easier experiment setup. Note that the actual CheXpert pipeline is a 3 step process, training, model picking (select best checkpoint) and evaluation. Each independent "draw" went through this process.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
chexpert_supervised		chexpert_supervised
image_source		image_source
moco_pretraining		moco_pretraining
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MoCo-CXR: MoCo Pretraining Improves Representations and Transferability of Chest X-Ray Models

Abstract

Methods

Evaluation

Checkpoints

Running the experiments

Pre-Training

MoCo-CXR Training with CheXpert

Additional Information

About

Releases

Packages

Languages

stanfordmlgroup/MoCo-CXR

Folders and files

Latest commit

History

Repository files navigation

MoCo-CXR: MoCo Pretraining Improves Representations and Transferability of Chest X-Ray Models

Abstract

Methods

Evaluation

Checkpoints

Running the experiments

Pre-Training

MoCo-CXR Training with CheXpert

Additional Information

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages