Skip to content

giacomolamberti90/CS229_project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Image-Caption Retrieval

Junyang Qian & Giacomo Lamberti

(CS229 Machine Learning)

The repository contains:

  • code: directory containing all the modules of the code:

      1. main.py              -> call the trainer
      
      2. trainer.py           -> train the models
      
      3. datasource.py        -> define dataset as iterator
      
      4. dataset.py           -> load the dataset
      
      5. evaluation.py        -> compute the evaluation metrics
      
      6. eval_only.py         -> perform evaluation (no training)
      
      7. eval_custom_input.py -> read an interactive caption and pull out the top 10 related images (need the raw pictures to work!)
    
  • results: directory containing text files with a summary of the various results.

      1. results_glove_mse_margin_01.txt      -> summary of training of GRU+GloVe model
      
      2. results_glove_LSTM_mse_margin_01.txt -> summary of training of LSTM+GloVe model
      
      3. acc_len.txt                          -> R@10 vs length of the caption for 3 models
    

The dataset (1 GB), including 10-crop VGG19 features, can be downloaded by running:

wget http://www.cs.toronto.edu/~vendrov/order/coco.zip

The code can be run by going to the code directory and typing: python main.py

However, the code won't run without data.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages