Skip to content
This repository has been archived by the owner on Jul 21, 2020. It is now read-only.

Files

Latest commit

4d5e6a6 · May 17, 2020

History

History
This branch is 112 commits behind yandexdataschool/Practical_RL:master.

week06_policy_based

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
Jan 24, 2020
May 1, 2020
Apr 10, 2020
Jan 24, 2020
Apr 12, 2020
May 5, 2020
May 17, 2020
Jan 24, 2020

Materials

More materials

  • Actually proving the policy gradient for discounted rewards - article

  • On variance of policy gradient and optimal baselines: article, another article

  • Learn Advatangeg Actor Critic with a comic

  • Generalizing log-derivative trick - url

  • Combining policy gradient and q-learning - arxiv

  • Variational perspective on reinforcement learning (from DeepBayes) - pdf

  • Adversarial review of policy gradient - blog

Run seminar notebook in colab: Open In Colab

Run optional homework notebook in colab: Open In Colab