GitHub - AdithyaVenkateshMohan/MADDPG-Tennis-Mlagents

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.idea		.idea
.ipynb_checkpoints		.ipynb_checkpoints
Tennis_Windows_x86_64		Tennis_Windows_x86_64
__pycache__		__pycache__
baselines		baselines
model_dir		model_dir
multiagent		multiagent
python		python
Lab_Questions		Lab_Questions
MultiAgent.ipynb		MultiAgent.ipynb
OUNoise.py		OUNoise.py
README		README
Tennis.ipynb		Tennis.ipynb
Tennis.py		Tennis.py
Tennis_Windows_x86_64.zip		Tennis_Windows_x86_64.zip
buffer.py		buffer.py
clean.sh		clean.sh
ddpg.py		ddpg.py
env_wrapper.py		env_wrapper.py
envs.py		envs.py
maddpg.py		maddpg.py
main.py		main.py
model.py		model.py
networkforall.py		networkforall.py
notebook.tar.gz		notebook.tar.gz
run_tensorboard.sh		run_tensorboard.sh
run_training.sh		run_training.sh
unity-environment.log		unity-environment.log
utilities.py		utilities.py
workspace_utils.py		workspace_utils.py

Repository files navigation

1. To run the code, please use the command "./run_training.sh". The bash script cleans up and DELETE previous runs. The script is necessary because we need an extra command to ensure image rendering is possible remotely. Training takes about two hour. If you run locally on your own computer. Be sure to increase the number of parallel agents to the number of cores your computer have in main.py. GPU does not help that much in the computation.

2. To see a visualization of the results, run the script "./run_tensorboard.sh". A link will appear, and direct your browser to that link to see rewards over time and other statistics

3. The trained models are stored in "model_dir" by default. You can also find .gif animations that show how the agents are performing! The gif file contains a grid of separate parallel agents.

4. To understand the goal of the environment: blue dots are the "good agents", and the Red dot is an "adversary". All of the agents' goals are to go near the green target. The blue agents know which one is green, but the Red agent is color-blind and does not know which target is green/black! The optimal solution is for the red agent to chase one of the blue agent, and for the blue agents to split up and go toward each of the target.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

AdithyaVenkateshMohan/MADDPG-Tennis-Mlagents

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages