You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -45,13 +45,13 @@ Check `README.md` under `data` for more details.
45
45
46
46
## Visiolinguistic Pre-training
47
47
48
-
To train the model:
48
+
Once you extracted all the image features, to train the model:
49
49
50
50
```
51
51
52
52
```
53
53
54
-
Distributed Training:
54
+
train the model in a distributed setting:
55
55
```
56
56
57
57
```
@@ -60,10 +60,12 @@ Distributed Training:
60
60
61
61
We can directly use the Pre-trained ViLBERT model for zero-shot image retrieval tasks on Flickr30k.
62
62
63
-
To evaluate on Flickr30k:
63
+
First, update `featyres_h5path1` and `val_annotations_jsonpath` in `vlbert_task.yml` to load the Flickr30k testset image feature and jsonfile (defualt is training feature).
64
64
65
-
```
66
-
python
65
+
Then, use the following command to evaluate pre-trained 6 layer ViLBERT model. (only support single GPU for evaluation now):
0 commit comments