DecPOMDPGridworld

This directory contains problem examples meant for the MADP Toolbox.

Running Solver

Command to run solver without incremental expansion: ../MADP/src/solvers/GMAA --sparse --GMAA=MAAstar .dpomdp -h4

Command to run solver with incremental expansion: ../MADP/src/solvers/GMAA --sparse --GMAA=MAAstar --BGIP_Solver=BnB --BnB-ordering=Prob .dpomdp -h4

Example 1: 23gwsimple

This is a basic 2x3 gridworld example. Two agents are moving around a reward map (shown in the image). They both must make the same action for it to take effect. (e.g. to move right they both must choose "right" as their action.) They start at the top right corner and must solve to navigate.

Solution Policy

Example 2: 23gw-rmap

This is similar to Example 1 except now there are two reward maps. The agents are able to observe what reward map they are on and navigate accordingly.

Solution Policy

Example 3: 23gw-comm

This expands on the previous examples. Now, for actions, they can either move or move AND communicate. Communication comes at a small cost. When they communicate, they are able to observe what reward map they are on.

Solution Policy

Example 4: 23gw-nocomm

This is the same as the previous example, except the reward maps are different.

Note that, regardless of which map the agents are on, there is always a path that doesn't hit an obstacle. This makes communication unnecessary. This is shown in the policy.

Example 5: 23gw-machine knows

In this example, we are back to the original two reward maps. The human is the one deciding movement in the environment and the machine is the one deciding whether or not to communicate. The machine knows everything about the environment. You can see that the machine decides to communicate and therefore the human traverses the obstacles correctly.

Example 6: 23gw-sharedctrl

In this example, the machine knows everything and has the option to communicate or take control. The cost to communicate is -1 and the cost to take control is -5. The machine chooses to communicate and not take control.

Example 7: 23gw-sharedctrl2

In this example, it's the same as before but the cost to communicate is -5 and the cost to take control is -1. In this one, the machine chooses to take control.

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
results		results
scripts		scripts
.gitignore		.gitignore
23gw-comm.dpomdp		23gw-comm.dpomdp
23gw-machknows.dpomdp		23gw-machknows.dpomdp
23gw-nocomm.dpomdp		23gw-nocomm.dpomdp
23gw-rmaps.dpomdp		23gw-rmaps.dpomdp
23gw-sharedctrl.dpomdp		23gw-sharedctrl.dpomdp
23gw-sharedctrl2.dpomdp		23gw-sharedctrl2.dpomdp
23gwsimple.dpomdp		23gwsimple.dpomdp
33gw-late.dpomdp		33gw-late.dpomdp
33gw-nocomm.dpomdp		33gw-nocomm.dpomdp
33gw-sharedcontrol-prob.dpomdp		33gw-sharedcontrol-prob.dpomdp
33gw-sharedcontrol.dpomdp		33gw-sharedcontrol.dpomdp
33gw.dpomdp		33gw.dpomdp
AnalyzeResults.py		AnalyzeResults.py
Ex1Rmap.png		Ex1Rmap.png
Ex2Rmap.png		Ex2Rmap.png
ExRmap3.png		ExRmap3.png
GW-gen.py		GW-gen.py
GWwriter.py		GWwriter.py
Policy.png		Policy.png
README.md		README.md
Rmap-sol.png		Rmap-sol.png
comm-sol.png		comm-sol.png
human-in-control.png		human-in-control.png
humanpol.png		humanpol.png
machine-communicate.png		machine-communicate.png
machine-take-control.png		machine-take-control.png
machinepol.png		machinepol.png
no-comm-example.png		no-comm-example.png
no-comm-rmaps.png		no-comm-rmaps.png
nonoptout.log		nonoptout.log
optoutputs.log		optoutputs.log
simple-sol.png		simple-sol.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DecPOMDPGridworld

Running Solver

Example 1: 23gwsimple

Solution Policy

Example 2: 23gw-rmap

Solution Policy

Example 3: 23gw-comm

Solution Policy

Example 4: 23gw-nocomm

Example 5: 23gw-machine knows

Example 6: 23gw-sharedctrl

Example 7: 23gw-sharedctrl2

About

Releases

Packages

Languages

AlyssaLytle/DecPOMDPGridworld

Folders and files

Latest commit

History

Repository files navigation

DecPOMDPGridworld

Running Solver

Example 1: 23gwsimple

Solution Policy

Example 2: 23gw-rmap

Solution Policy

Example 3: 23gw-comm

Solution Policy

Example 4: 23gw-nocomm

Example 5: 23gw-machine knows

Example 6: 23gw-sharedctrl

Example 7: 23gw-sharedctrl2

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages