Name		Name	Last commit message	Last commit date
parent directory ..
models		models
phi_functions		phi_functions
README.md		README.md
main_algo.py		main_algo.py
policy.py		policy.py
run.py		run.py
tb_logger.py		tb_logger.py
traj_visualize.py		traj_visualize.py
utils.py		utils.py
value_function.py		value_function.py
walker2d_train_eval.sh		walker2d_train_eval.sh

README.md

Evaluations of PPO with/without Stein control variate

This is the code of the evaluation part of Stein control variate. It evaluates different variance reduction methods introduced in the paper.

Running Examples

Take Walker2d-v1 environment as an example.

Train and generate evaluation data:

#Evaluation Policy with or without Stein control variates
bash walker2d_train_eval.sh

Different max-timesteps lead to different scale of variance. NB: Max-timesteps can be set through -m option, larger max-timesteps leads to larger batch-size which need more iterations to fit.

Visualize the variance plot of different optimization Phi function methods:

# plot variance figure
python traj_visualize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

evaluation

evaluation

README.md

Evaluations of PPO with/without Stein control variate

Running Examples

Files

evaluation

Directory actions

More options

Directory actions

More options

Latest commit

History

evaluation

Folders and files

parent directory

README.md

Evaluations of PPO with/without Stein control variate

Running Examples