Imitation Learning

Dependencies: TensorFlow, MuJoCo version 1.31, OpenAI Gym

Note: MuJoCo versions until 1.5 do not support NVMe disks therefore won't be compatible with recent Mac machines. There is a request for OpenAI to support it that can be followed here.

run_expert.ipynb, which is code to load up an expert policy, run a specified number of roll-outs, and save out data. 'run_clone.ipynb', which is code to use a neural network to imitate the expery policy 'run_dagger.ipynb', which is code to use the behavior cloning and dataset aggregation to better imitate the expery policy

In experts/, the provided expert policies are:

Ant-v1.pkl
HalfCheetah-v1.pkl
Hopper-v1.pkl
Humanoid-v1.pkl
Reacher-v1.pkl
Walker2d-v1.pkl

The name of the pickle file corresponds to the name of the gym environment.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
data		data
experts		experts
models		models
README.md		README.md
demo.bash		demo.bash
load_policy.py		load_policy.py
run_clone.ipynb		run_clone.ipynb
run_dagger.ipynb		run_dagger.ipynb
run_expert.ipynb		run_expert.ipynb
run_expert.py		run_expert.py
tf_util.py		tf_util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Imitation Learning

About

Releases

Packages

Languages

rudolfsteiner/DAgger

Folders and files

Latest commit

History

Repository files navigation

Imitation Learning

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages