GitHub - ChinmayMundane/RL_Basics: My implementations and notes on RL (Feb24-July24 self study)

ChinmayMundane / RL_Basics Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

My implementations and notes on RL (Feb24-July24 self study)

0 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Pytorch yt		Pytorch yt
pytorch notes		pytorch notes
set_1		set_1
1_5_contextual_bandits.ipynb		1_5_contextual_bandits.ipynb
1_Two_armed_Bandit.ipynb		1_Two_armed_Bandit.ipynb
README.md		README.md
RL_01-altered.ipynb		RL_01-altered.ipynb
RL_01.ipynb		RL_01.ipynb
my_RL_notes.pdf		my_RL_notes.pdf
reinforcement_q_learning.ipynb		reinforcement_q_learning.ipynb

Repository files navigation

Structure

This repo includes

Pytorch notes and its implementations
Classic RL algorithms
notes I made going through the resources.
training and testing a RL agent in multiple environment in metadrive simulator(open source autonomous driving simulator)

Execution

You can run the test codes normally on google collab or jupyter notebook

For RL training and scripts
install metadrive
move the "set_1 folder under metadrive/examples"
cd under the metadrive and run

python -m metadrive.examples.file_name

(Remember to comment out some part of codes to customise the settings)

Resources

A tutorial on MADDPG(not imp right now) - https://medium.com/machine-intelligence-and-deep-learning-lab/a-tutorial-on-maddpg-53241ae8aac
Davild silver playlist - https://www.davidsilver.uk/teaching/
Policy based, on/off policy, model based/free - https://stats.stackexchange.com/questions/407230/what-is-the-difference-between-policy-based-on-policy-value-based-off-policy
Q learning - https://www.avenga.com/magazine/q-learning-applications/#:~:text=The%20optimal%20value%20function%20
Deep RL bootcamp - https://www.youtube.com/watch?v=qaMdN6LS9rA
Policy and value iteration - https://medium.com/@m.alzantot/deep-reinforcement-learning-demysitifed-episode-2-policy-iteration-value-iteration-and-q-978f9e89ddaa
Policy iteration : https://towardsdatascience.com/policy-iteration-in-rl-an-illustration-6d58bdcb87a7
PPO - https://huggingface.co/blog/deep-rl-ppo
Theory + implementation reference - https://huggingface.co/learn/deep-rl-course/unit8/introduction

About

My implementations and notes on RL (Feb24-July24 self study)

Report repository

Releases

No releases published

Packages

No packages published

Languages