Deep-Q-Learning-Agent

This project demonstrates the generalizability of “Deep Q-Learning” in learning control policies from visual outputs of different environments. The model aims to map the end-to-end relation between the visual outputs (rendered 2-d visuals of a game) to the next action (control signals like move up, move right) so as to gain maximum cumulative reward (score). The learning phase of the model involves simultaneously training the model while making predictions at each step (frame to action).

Performance

Env: rover_lander_1

Model	Performance	Episode 20	Episode 200	Train Score / episode
FCNN	Open TensorBoard
CNN	Open TensorBoard

Env: rover_lander_2

Model	Performance	Episode 20	Episode 200	Train Score / episode
FCNN	Open TensorBoard
CNN	Open TensorBoard

Training model

Local

First, on your local machine run:

python train_master.py

Note: Use a port-forwarding tool like ngrok to expose the endpoint created

To moniter logs streamed from remote workers on your local machine, run:

tensorboard --logdir logs

Remote

Now, on each remote workstation run:

python train_worker.py \
                        --env <ENV_NAME> \
                        --master-endpoint <MASTER_ENDPOINT> \
                        --worker-name <WORKER_NAME>

To train using the CNN based model run:

python train_worker_cnn.py \
                        --env <ENV_NAME> \
                        --master-endpoint <MASTER_ENDPOINT> \
                        --worker-name <WORKER_NAME>

Or, run remote worker from Google Colab - https://colab.research.google.com/github/ArjunInventor/Deep-Q-Learning-Agent/blob/master/train_worker.ipynb

Testing agent

python play.py --model <MODEL_PATH> --env <ENV_NAME>

When using a CNN based model, run:

python play_cnn.py --model <MODEL_PATH> --env <ENV_NAME>

Use --save-gif to save the gameplay as a gif

Project Report

This project is in completion of INT404 assignment and the final report can be found here.

Name		Name	Last commit message	Last commit date
Latest commit History 82 Commits
envs		envs
gameplay		gameplay
metrics		metrics
.gitignore		.gitignore
Final Report - AI Project.pdf		Final Report - AI Project.pdf
LICENSE		LICENSE
README.md		README.md
play.py		play.py
play_cnn.py		play_cnn.py
requirements.txt		requirements.txt
train_master.py		train_master.py
train_worker.ipynb		train_worker.ipynb
train_worker.py		train_worker.py
train_worker_cnn.py		train_worker_cnn.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep-Q-Learning-Agent

Performance

Env: rover_lander_1

Env: rover_lander_2

Training model

Local

Remote

Testing agent

Project Report

About

Releases

Packages

Contributors 2

Languages

License

arjuninv/Deep-Q-Learning-Agent

Folders and files

Latest commit

History

Repository files navigation

Deep-Q-Learning-Agent

Performance

Env: rover_lander_1

Env: rover_lander_2

Training model

Local

Remote

Testing agent

Project Report

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages