Skip to content

Files

Latest commit

f48aad0 · Mar 17, 2021

History

History
21 lines (13 loc) · 621 Bytes

README.md

File metadata and controls

21 lines (13 loc) · 621 Bytes

Image-Captioning

Implementation of of the paper Show-Atten-Tell and use of beam search.

Bleu score of 22 with beam size of 3.

Dataset

Flickr8k dataset, 8000 images 64x64 pixels.

Training

Execute training: python train.py You can choose to use pre-trained embedding.

Monitor/Track training

Change the WANDB_TRACK = False to WANDB_TRACK = True

Notebooks

The full training and evaluation of metrics is shown in this notebook