New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Some questions about "train_baseline.py" #185

Open

Aaricis opened this issue Sep 21, 2020 · 0 comments

Aaricis commented Sep 21, 2020

I run train_baseline.py, and after some iterations, I got information like this:

The policy_reward_mean always equals 0. I do not know whether this result is correct.

The text was updated successfully, but these errors were encountered:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment