How to understand the following implementation of loss in theano? #88

yxchng · 2017-02-26T06:35:02Z

I don't see how the stochasticity of the actions 1/T*... is implemented in the theano line above. Isn't log_ likelihood_sym only computing one distribution? and not T of them and taking the average.

yxchng · 2017-02-26T06:40:44Z

Also, I think the right expression should be this, meaning it should take the gradient of the loglikelihood and not average

Isn't it?

dementrock · 2017-02-26T22:08:37Z

I don't understand your question. Do you mean it should be 1/N * ... instead of 1/NT * ... ?

yxchng · 2017-02-27T02:15:25Z

I think so. Not sure

dementrock · 2017-02-27T07:40:22Z

Generally you want your loss / gradient updates to be scale invariant. Also it does not really matter if you use rmsprop / adam etc. since they automatically rescale your gradients.

Add a customized tensor scalar to tensorboard by using the custom_scalar plugin in tensorboard. Each line in the scalar corresponds to an element in the tensor. Wrap the tensorboard logging module into a new class `Summary` in file rllab/misc/tensor_summary.py. It supports both the simple value and tensor logging. It also saves the computation graph created by rllab. To record the tensor into tensorboard, use the `record_tensor` function in file rllab/misc/logger.py. Refer to: rll#39, rll#38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to understand the following implementation of loss in theano? #88

How to understand the following implementation of loss in theano? #88

yxchng commented Feb 26, 2017 •

edited

Loading

yxchng commented Feb 26, 2017 •

edited

Loading

dementrock commented Feb 26, 2017

yxchng commented Feb 27, 2017

dementrock commented Feb 27, 2017

How to understand the following implementation of loss in theano? #88

How to understand the following implementation of loss in theano? #88

Comments

yxchng commented Feb 26, 2017 • edited Loading

yxchng commented Feb 26, 2017 • edited Loading

dementrock commented Feb 26, 2017

yxchng commented Feb 27, 2017

dementrock commented Feb 27, 2017

yxchng commented Feb 26, 2017 •

edited

Loading

yxchng commented Feb 26, 2017 •

edited

Loading