Least Sqaures Policy Iteration
- python3
- gym
- tensorflow=1.5 (I tried tensorflow 1.13, but it's trapped)
- numpy
python3 main.py
In main.py
, you can choose basis function option.
1.gaussian
2.deep_cartpole
3.dan(deep_action_network)_h1
4.dan(deep_action_network)_perd