Hyperparameter-Optimization-for-Deep-Q-Networks

Final Project for COMS 6998 Deep Learning Systems Performance at Columbia University

Collaborator: In Wai Cheong (https://www.github.com/InwaiCheong)

References: https://github.com/sweetice/Deep-reinforcement-learning-with-pytorch (heavily modified the DQN files from this repo)

Projection Description

To measure the sensitivity of Deep Q-Networks on different tasks subject to learning rate, batch size, optimizer, target Q network update step size, discount factor, and other hyperparameters to identify the relationship between hyperparameters and efficient convergence to the optimal policy across different state/action regimes.

Methods Implemented

Random Search
Successive Halving
Bayesian Optimization

Implementation Details

View report for in-depth details about our implementation.

File Descriptions

The notebooks can be downloaded and ran as is. There are two notebooks each for Successive Halving and Random Search. One is the implementation and the other is visualization of the agent. Bayesianopt.ipynb includes the bayesian optimization implementation.

Key Takeaways:

Even simple games are enormously sensitive to hyperparameter tuning.
Sample complexity of Deep RL is very high, and the reward signal is very sparse, implying that parametric methods that rely on information regarding obtained rewards (e.g. Bayesian Optimization) do not work very well.
Given the scarcity of the reward signal in our examples, pseudo-evolutionary methods (such as successive halving approaches) actually work best.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.gitignore		.gitignore
BayesianOpt.ipynb		BayesianOpt.ipynb
Final Report.pdf		Final Report.pdf
README.md		README.md
Random Search Visualization.ipynb		Random Search Visualization.ipynb
Random Search.ipynb		Random Search.ipynb
Successive Halving Visualization.ipynb		Successive Halving Visualization.ipynb
Successive Halving.ipynb		Successive Halving.ipynb
dqn_main.py		dqn_main.py
hypopt.py		hypopt.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hyperparameter-Optimization-for-Deep-Q-Networks

Projection Description

Methods Implemented

Implementation Details

File Descriptions

Key Takeaways:

About

Releases

Packages

Languages

arkwave/dqn-hypopt

Folders and files

Latest commit

History

Repository files navigation

Hyperparameter-Optimization-for-Deep-Q-Networks

Projection Description

Methods Implemented

Implementation Details

File Descriptions

Key Takeaways:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages