cross entropy is wrong #32

yjxaigithub · 2021-08-15T09:46:55Z

in https://github.com/tensorlayer/RLzoo/blob/master/rlzoo/common/distributions.py#L96, I think it should modified to "return -tf.reduce_sum(x*self._logits, axis=1)" for returning cross entropy because self._logits is already the logarithm of the probability .

quantumiracle · 2021-08-19T22:35:32Z

I'm not sure why you said self._logits is already the logarithm of the probability. It refers to the logits before the softmax operation in standard literature, check the usage here.

yjxaigithub · 2021-08-20T01:21:16Z

The kl function and entropy imply the self._logits is already the logarithm of the probability. If not, the calculation of kl divergence and entropy are wrong

quantumiracle · 2021-08-20T01:56:08Z

Ok, I saw what confused you. This issue can also be solved together with #31. In our implementation, self._logits is just the logits but not the log probability, and the KL function and entropy are not wrong in this sense. They should not follow what you suggested in #31. We intended to calculate the KL and entropy with the logits at inputs but not standard log probabilities, so it does not follow the standard math formulas as you suggested in #31. Both KL and entropy calculations contains a softmax function explicitly, it is why you see the exp and sum in our (complex) implementation.

I will close both issues. If you find other problems, feel free to report them!

quantumiracle closed this as completed Aug 20, 2021

quantumiracle mentioned this issue Aug 20, 2021

A suggestion about kl divergence and entropy implementation of categorical distribution #31

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cross entropy is wrong #32

cross entropy is wrong #32

yjxaigithub commented Aug 15, 2021

quantumiracle commented Aug 19, 2021

yjxaigithub commented Aug 20, 2021 •

edited

Loading

quantumiracle commented Aug 20, 2021

cross entropy is wrong #32

cross entropy is wrong #32

Comments

yjxaigithub commented Aug 15, 2021

quantumiracle commented Aug 19, 2021

yjxaigithub commented Aug 20, 2021 • edited Loading

quantumiracle commented Aug 20, 2021

yjxaigithub commented Aug 20, 2021 •

edited

Loading