Skip to content

A Python implementation of the UC-DTR algorithm of Zhang and Bareinboim

License

Notifications You must be signed in to change notification settings

shlomenu/uc_dtr

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

UC-DTR

The UC-DTR algorithm exploits causal bounds to provide provably tighter estimates of the value of policies in a dynamic treatment regime (DTR), a kind of non-Markovian decision process studied in medical literature on multi-stage clinical interventions. I investigated this algorithm due to an interest in whether it could be used for machine translation in low-resource settings involving artificial languages with unidirectional semantics. In the process, I reimplemented the UC-DTR algorithm presented in UC-DTR paper (Zhang & Bareinboim, 2019). I am grateful to the original authors, who were kind to share their original implementation with me privately.

Install the requirements specified by requirements.txt and run the standalone script with python uc_dtr.py [--help]. The default parameters allow the script to run at a tolerable speed on my local machine. Upon completion, a plot of the rolling average regret resulting from random exploration and UC-DTR is saved to uc_dtr_simulation-TIMESTAMP.png.

About

A Python implementation of the UC-DTR algorithm of Zhang and Bareinboim

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages