GitHub - martinarjovsky/Reversi-AI: A Reinforcement Learning based agent that plays Reversi

This is the implementation of a Reinforcement Learning agent I developed that learns how to play reversi. The algorithm it's based on is called UCT (Upper Confidence bounds applied to Trees) which is a variation from traditional Monte Carlo Tree Search algorithms, taking advantage of the UCB algorithms for solving the K-armed bandit problem. To run the agent call the function uct in matlab or octave.

If you have any questions regarding this implementation or the agent feel free to contact me at [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
InfoRL.pdf		InfoRL.pdf
README.md		README.md
dec_tree.m		dec_tree.m
doAction.m		doAction.m
findValidMoves.m		findValidMoves.m
heuristic11.m		heuristic11.m
heuristic12.m		heuristic12.m
heuristic21.m		heuristic21.m
heuristic22.m		heuristic22.m
init_state.m		init_state.m
playGames.m		playGames.m
plotq.m		plotq.m
toAct.m		toAct.m
uct.m		uct.m

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

martinarjovsky/Reversi-AI

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages