-
Notifications
You must be signed in to change notification settings - Fork 724
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Weird tensorboard plots in GAIL #577
Comments
But then I can't use |
You have to wrap each individual environment with |
@Miffyli Could you elaborate a bit more on what to do? I need to have multiple processes, each with a separate normalized env, but I get the following error:
Here is the error trace:
|
Please read the documentation, we have different examples showing the use of a Monitor wrapper.
Why don't you use a |
Using |
Took me a while to realize: If you look carefully at the def create_env():
actenv = gym.make(args.env_id)
os.makedirs(osp.join(args.tb_log, 'ppo2'), exist_ok=True)
return Monitor(actenv, osp.join(args.tb_log, 'ppo2', '{}'.format(i), ), allow_early_resets=True)
allenvs = []
for i in range(args.num_procs):
allenvs.append(create_env)
env = SubprocVecEnv(allenvs, ) Note to self: This should probably be clarified in docs as it can be easily lost among all the lambdaing and whatnot. |
I agree and disagree at the same time. In the doc, we only use EDIT: the type in the doc is wrong btw |
@Miffyli Thanks, this seems to work. Now I realize that my problem was very subtle, and cloudpickle can pickle the function, but not the premade environment that I passed into it. |
Describe the bug
While training GAIL, I get plots which go back in the x-axis in tensorboard. Please see example.
Code example
Here is the graph that I get
System Info
Describe the characteristic of your environment:
This problem doesn't occur with TRPO where I get reasonable plots.
The text was updated successfully, but these errors were encountered: