Policy Gradient REINFORCE algorithm not converging. #26

padmaja-kulkarni · 2020-05-16T21:58:13Z

First of all, thank you for the tutorial here!

I am trying to implement/run your code mentioned in the tutorial, however, the results are not converging after 500 steps as shown in the image 'Reward: Training progress of Policy Gradient RL in Cartpole environment". Even after 5000 steps, the reward is around 10. Is this correct?

Thanks again!

asokraju mentioned this issue Sep 10, 2020

Bugfix policy gradinet reinforce tf2 #29

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Policy Gradient REINFORCE algorithm not converging. #26

Policy Gradient REINFORCE algorithm not converging. #26

padmaja-kulkarni commented May 16, 2020 •

edited

Loading

Policy Gradient REINFORCE algorithm not converging. #26

Policy Gradient REINFORCE algorithm not converging. #26

Comments

padmaja-kulkarni commented May 16, 2020 • edited Loading

padmaja-kulkarni commented May 16, 2020 •

edited

Loading