LSPI(Least Squares Policy Iteration) with TF 1.5.0

LSPI?

Least Sqaures Policy Iteration

Dependency

language

python3

libraries

gym
tensorflow=1.5 (I tried tensorflow 1.13, but it's trapped)
numpy

Run

python3 main.py

In main.py, you can choose basis function option.

1.gaussian

2.deep_cartpole

3.dan(deep_action_network)_h1

4.dan(deep_action_network)_perd

reference

Batch, Off-policy and Model-Free Apprenticeship Learning

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
LSPI&BOMAP_final_meeting.pdf		LSPI&BOMAP_final_meeting.pdf
LSPI&BOMAP_first_meeting.pdf		LSPI&BOMAP_first_meeting.pdf
LSPI&BOMAP_initiative.pdf		LSPI&BOMAP_initiative.pdf
README.md		README.md
deep_action_network.py		deep_action_network.py
deep_cartpole.py		deep_cartpole.py
lspi.py		lspi.py
main.py		main.py
policy.py		policy.py
rbf.py		rbf.py
record.py		record.py
replay_memory.py		replay_memory.py
test.py		test.py
tf_utils.py		tf_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LSPI(Least Squares Policy Iteration) with TF 1.5.0

LSPI?

Dependency

language

libraries

Run

reference

About

Releases

Packages

Languages

jeonggwanlee/LSPI

Folders and files

Latest commit

History

Repository files navigation

LSPI(Least Squares Policy Iteration) with TF 1.5.0

LSPI?

Dependency

language

libraries

Run

reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages