video-dialog-framework

A basic video dialog and video question answer framework. This repository proposed a simple encoder-decoder vqa and video-dialog framework. You can easily change your custom encoder or decoder by adding a new encoder-class or decoder-class. And this repository contains several simple encoders and decoders(multi-choice decoder and open-domain decoder).

Requirements

python2.7
redis

Usage

Download dataset.

Baidu Pan: link passwd: 7c9b

(or, Google driver: link )
Install
- install redis
- pip install -r requirements.txt
start redis-server (train database server)

redis-server redis.conf
Train
- write a configure file (reference to conf/)
- python tools/train.py -c [config-file-path] (for example, python tools/train.py -c conf/lf.yaml)
Evaluate

python tools/eval.py -c [config-file-path]

Remark

I only release a simple version of video-dialog-framework. Maybe the data preprocessing is a little bit slow. For accelerating the training process, you can convert the data to tf-record, and change the data-load code to read tf-record.
The GDecoder class is a very naive decoder, maybe you can add attention mechanism to decode more accurate answer.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
conf		conf
data		data
mvd		mvd
tools		tools
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

video-dialog-framework

Requirements

Usage

Remark

About

Releases

Packages

Languages

nilboy/video-dialog-framework

Folders and files

Latest commit

History

Repository files navigation

video-dialog-framework

Requirements

Usage

Remark

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages