Reason8.ai code for 3th place NIPS learning to run challenge.
We are porting this code to pytorch here
-
Get opensim package. You can use default package as described here or build by youself faster version here
-
Run setup script:
$ ./setup.sh
You may want to change conda env name in script and comment last line if not building opnesim by yourself
-
If you experience theano errors try to move .theanorc file:
$ mv ~/.theanorc ~/.theanorc.backup
-
Activate environment:
$ source activate nips_rl_fast3
-
Run code with best flags:
$ python run_experiment.py --param_noise_prob 0.3 --flip_prob 1 --layer_norm
The final submitted model was trained in this commit.
There are lot of branches with various ideas tested during competition but without documentation, you could check for example following branches:
- distributed ddpg with pyro4 inspired by ctmarko repository
- distributed CEM with pyro4 I am not sure that this is canonical implementation, it was done in the last night.
- we even tried to do planning as described in this article