Necessity of a Custom optimizer for the Critic (A2C). #19

davinellulinvega · 2019-07-15T13:40:24Z

Hello Germain / Everyone,

I am currently trying to implement the A2C algorithm as part of a simulation for my PhD. Given that, I have very limited time to do so, your source code is a great help, since the algorithm and operations are clearly outlined and not hidden away as is the case for OpenAI baseline implementation.
Still after having a look at the code in critic.py, I was wondering why did you define a custom optimizer for the critic has well (it is clearly justified for the actor), when simply compiling the critic network and passing MSE as the loss seem to have the same effect? Is there something I am missing here?
Anyway, that was just a though nothing game changing. Thanks a lot for sharing those implementations.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Necessity of a Custom optimizer for the Critic (A2C). #19

Necessity of a Custom optimizer for the Critic (A2C). #19

davinellulinvega commented Jul 15, 2019

Necessity of a Custom optimizer for the Critic (A2C). #19

Necessity of a Custom optimizer for the Critic (A2C). #19

Comments

davinellulinvega commented Jul 15, 2019