Skip to content

Augmented Random Search (ARS) and Deep Q Learning (DQN) reinforcement learning techniques to teach a simulated bipedal robot to walk.

Notifications You must be signed in to change notification settings

jacobazoulay/bipedal-robot-rl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Teaching a Robot to Walk Using Reinforcement Learning

This project applies Deep Q-Learning (DQN) and Augmented Random Search (ARS) to teach a simulated two-dimensional bipedal robot how to walk using the OpenAI Gym BipedalWalker-v3 environment. Deep Q-learning did not yield a high reward policy, often prematurely converging to suboptimal local maxima likely due to the coarsely discretized action space. ARS, however, resulted in a better trained robot, and produced an optimal policy which officially "solves" the BipedalWalker-v3 problem. Various naive policies, including a random policy, a manually encoded inch forward policy, and a stay still policy, were used as benchmarks to evaluate the proficiency of the learning algorithm results.

This is the final project for AA228 Decision Making Under Uncertainty course at Stanford University.

The final paper can be found here.

Directory Structure

  • assets: contains images for documentation and stored dqn model parameters
  • data: contains results and performance metrics for various tested models.
  • src:
    • eval: contains model evaluation functions
    • models: contains various robot agent models
      • dqn: directory containing PyTorch and TensorForce versions of Deep Q-Learning models and training functions.
      • ARS.py: Augmented Random Search (ARS) model and training function.
      • predeterimined.py: manually hardcoded policy for baseline
      • random.py: random policy for baseline
  • utils: contains plotting util

Project Details

Rendered Environment

Robot rendered environment

DQN Architecture

DQN architecture

ARS Result

ARS reward

About

Augmented Random Search (ARS) and Deep Q Learning (DQN) reinforcement learning techniques to teach a simulated bipedal robot to walk.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published