-
Notifications
You must be signed in to change notification settings - Fork 800
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to get the vpg_1.py in examples to work with other environments such as Acrobot and monitor their performance? #89
Comments
It should be relatively easy if you follow gym's documentation. Also you can refer to gym_env.py which is a wrapper around gym environments. |
What should I change to above part when dealing with the following environment?
Currently I am getting the following error.
I am using CategoricalMLPPolicy. |
The documentation is very incomplete and the examples do not tell me anything related to what I want to do. Don't quite understand why it works for GaussianMLP and the original env but when those are changed, it doesnt work anymore. |
It is because CartPole uses discrete actions. Use a CategoricalMLPPolicy instead. |
I mentioned above I used CategoricalMLPPolicy but I still face the same problem |
Can you show your entire snippet of code? |
Hi @yxchng, I could not find gym environment To simply solve your problem:
|
This is my full code. The error still persists.
|
Hi, it seems like you are using the GaussianMLPPolicy, but you should be using CategoricalMLPPolicy. Another thing with discrete actions: methods like |
I edited the code according to your advice.
I follow your advice but now it gives me
|
I tried using force_reset=True but it doesnt work. |
Hi, Make sure you are using this commit of gym exactly: 93d554bdbb4b2d29ff1a685158dbde93b36e3801 Refer to https://github.com/openai/rllab/blob/master/environment.yml. Make sure you are using the latest rllab code. |
I also have the same error raise error.Error("Tried to reset environment which is not done. While the monitor is active for {}, you cannot call reset() unless the episode is over.".format(self.env_id)) |
@gyang1011 set force_reset to True. |
What I mean by monitor here is to use gym.wrappers.
The text was updated successfully, but these errors were encountered: