Pi output in tensorflow is incorrect: missing softmax layer. #214

josey4869 · 2020-09-07T21:54:09Z

In othello.tensorflow.OthelloNNet.py:

The pi output of the model is

self.pi = Dense(s_fc2, self.action_size) (line 36)

which is incorrect. In fact, it should be

self.pi = tf.nn.softmax(Dense(s_fc2, self.action_size)).

By contrast, the implementation in Keras is correct. In othello.keras.OthelloNNet.py:

self.pi = Dense(self.action_size, activation='softmax', name='pi')(s_fc2) (line 28).

Pull request: #215 (comment)

The text was updated successfully, but these errors were encountered:

goshawk22 · 2020-09-08T06:14:30Z

The softmax is applied in the line after and the layer is assigned to the variable self.prob which is the used instead of self.pi in the Nnet file.

josey4869 · 2020-09-08T12:03:18Z

The softmax is applied in the line after and the layer is assigned to the variable self.prob which is the used instead of self.pi in the Nnet file.

Thanks for the reply! According to line 48 & line 131:

self.loss_pi = tf.losses.softmax_cross_entropy(self.target_pis, self.pi),

It seems self.pi instead of self.prod is used for computing pi_loss? So I guess we should either assign a softmax layer before outputting self.pi or feed self.prod instead of self.pi into pi_loss?

goshawk22 · 2020-09-08T14:17:10Z

Yes I think it might make more sense to just change it to add the softmax layer to self.pi to avoid confusion.

rlronan · 2021-06-09T17:19:11Z

Softmax crossentropy expects logits in tensorflow, so softmax should not be applied to pi directly.
https://www.tensorflow.org/api_docs/python/tf/compat/v1/losses/softmax_cross_entropy

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pi output in tensorflow is incorrect: missing softmax layer. #214

Pi output in tensorflow is incorrect: missing softmax layer. #214

josey4869 commented Sep 7, 2020 •

edited

Loading

goshawk22 commented Sep 8, 2020

josey4869 commented Sep 8, 2020

goshawk22 commented Sep 8, 2020

rlronan commented Jun 9, 2021

Pi output in tensorflow is incorrect: missing softmax layer. #214

Pi output in tensorflow is incorrect: missing softmax layer. #214

Comments

josey4869 commented Sep 7, 2020 • edited Loading

goshawk22 commented Sep 8, 2020

josey4869 commented Sep 8, 2020

goshawk22 commented Sep 8, 2020

rlronan commented Jun 9, 2021

josey4869 commented Sep 7, 2020 •

edited

Loading