Dynamic-horizon Model-based Value Expansion (DMVE)

This repository implements the main algorithm of DMVE described in "Dynamic-horizon Model-based Value Estimation with Latent Imagination" (IEEE Transactions on Neural Networks and Learning Systems).

Method

The overview of Dynamic-horizon Model-based Value Expansion (DMVE). DMVE dynamically adjusts the rollout horizon according to the value expansion errors between the original and reconstructed images. The framework consists of two parts, the first part is value expansion based on the fixed rollout length with latent imagination, and on the basis of the first part, the second part dynamically adjusts the rollout horizon by its reliability, which is reflected by the difference between the value expansion for both reconstructed and the raw images.

Instructions

Get dependencies:

Install MuJoCo 2.1.0 at ~/.mujoco/mujoco210 and copy your license key to ~/.mujoco/mjkey.txt

Install required packages with

conda create -n dmve python=3.7
conda activate dmve
pip install -r requirements.txt

Train the agent:

python dmve.py --logdir ./logdir/dmc_walker_run/dmve/1 --task dmc_walker_run

Generate plots:

python plotting.py --indir ./logdir --outdir ./plots --xaxis step --yaxis test/return --bins 3e4

Graphs and GIFs:

tensorboard --logdir ./logdir

Citation

@article{wang2022dynamic,
  title={Dynamic-horizon model-based value estimation with latent imagination},
  author={Wang, Junjie and Zhang, Qichao and Zhao, Dongbin},
  journal={IEEE Transactions on Neural Networks and Learning Systems},
  year={2022},
  publisher={IEEE},
  doi={10.1109/TNNLS.2022.3215788}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
figs		figs
LICENSE		LICENSE
README.md		README.md
dmve.py		dmve.py
models.py		models.py
plotting.py		plotting.py
requirements.txt		requirements.txt
tools.py		tools.py
wrappers.py		wrappers.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dynamic-horizon Model-based Value Expansion (DMVE)

Method

Instructions

Citation

About

Releases

Packages

Languages

License

JunjieWang95/dmve

Folders and files

Latest commit

History

Repository files navigation

Dynamic-horizon Model-based Value Expansion (DMVE)

Method

Instructions

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages