Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Using ppo_rnn.py with cartpole might be a bit misleading since cartpole is a fully observable environment. I made this wrapper, which masks elements from the observation vector to create partially observable environments. This could be applied to environments used with ppo_rnn to make them partially observable.
To use this wrapper, you would need to add the following line:
env = MaskedObservationWrapper(env,config = {'obs_idx':[0,2],'mu':0.0,'sigma':0.1})
The obs_idx list indicates which indices will be masked from the observation vector. A noise will also be added to the remaining elements of the observation vector to make the task harder.