Multi-Armed Bandits within Reinforcement Learning

This project contains my work on the Multi-Armed Bandit (MAB) problem and variations and extensions thereof. This work was carried out as part of an Independent Study in Computer Science during university. This problem in encountered within probability theory, statistics, and machine learning (specifically, Reinforcement Learning). It is well-regarded as an improvement upon the widely adopted A/B Testing approach, due to the model's dynamic learning and incorporation of outcomes and feedback. A central concept within MABs is that of the tradeoff between exploration (learning about options) and exploitation (maximizing the current best option).

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
CMAB - CUCB Disjoint Window, Dependency v1.ipynb		CMAB - CUCB Disjoint Window, Dependency v1.ipynb
CMAB - CUCB Disjoint Window, Dependency v2.ipynb		CMAB - CUCB Disjoint Window, Dependency v2.ipynb
CMAB - Choosing Top 2 Arms.ipynb		CMAB - Choosing Top 2 Arms.ipynb
CMAB - Initial Work.ipynb		CMAB - Initial Work.ipynb
MAB & CMAB Paper.pdf		MAB & CMAB Paper.pdf
Multi-Armed Bandit.ipynb		Multi-Armed Bandit.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Armed Bandits within Reinforcement Learning

About

Releases

Packages

Languages

krishnarupa1008/Multi-Armed-Bandits-Combinatorial-Multi-Armed-Bandits

Folders and files

Latest commit

History

Repository files navigation

Multi-Armed Bandits within Reinforcement Learning

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages