Name		Name	Last commit message	Last commit date
parent directory ..
README.rst		README.rst
__init__.py		__init__.py
hyperparams.yml		hyperparams.yml
model_setup.py		model_setup.py
util.py		util.py

README.rst

The Multi-armed Bandit

A simple example of how to build a policy-gradient based agent that can solve the multi-armed bandit problem.