Skip to content

code for "Dynamic-horizon Model-based Value Estimation with Latent Imagination"

License

Notifications You must be signed in to change notification settings

JunjieWang95/dmve

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Dynamic-horizon Model-based Value Expansion (DMVE)

This repository implements the main algorithm of DMVE described in "Dynamic-horizon Model-based Value Estimation with Latent Imagination" (IEEE Transactions on Neural Networks and Learning Systems).

Method

DMVE

The overview of Dynamic-horizon Model-based Value Expansion (DMVE). DMVE dynamically adjusts the rollout horizon according to the value expansion errors between the original and reconstructed images. The framework consists of two parts, the first part is value expansion based on the fixed rollout length with latent imagination, and on the basis of the first part, the second part dynamically adjusts the rollout horizon by its reliability, which is reflected by the difference between the value expansion for both reconstructed and the raw images.

Instructions

Get dependencies:

Install MuJoCo 2.1.0 at ~/.mujoco/mujoco210 and copy your license key to ~/.mujoco/mjkey.txt

Install required packages with

conda create -n dmve python=3.7
conda activate dmve
pip install -r requirements.txt

Train the agent:

python dmve.py --logdir ./logdir/dmc_walker_run/dmve/1 --task dmc_walker_run

Generate plots:

python plotting.py --indir ./logdir --outdir ./plots --xaxis step --yaxis test/return --bins 3e4

Graphs and GIFs:

tensorboard --logdir ./logdir

Citation

@article{wang2022dynamic,
  title={Dynamic-horizon model-based value estimation with latent imagination},
  author={Wang, Junjie and Zhang, Qichao and Zhao, Dongbin},
  journal={IEEE Transactions on Neural Networks and Learning Systems},
  year={2022},
  publisher={IEEE},
  doi={10.1109/TNNLS.2022.3215788}
}

About

code for "Dynamic-horizon Model-based Value Estimation with Latent Imagination"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages