-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The X server sometimes work but sometimes stop. #22
Comments
Hi @JisuHann , Can you try reducing the number of processes used during training (i.e. change this line) and see if this allows training to proceed? A newer version of this codebase can be found at https://github.com/allenai/disturb-free which was a follow-up work. The version of |
Thank you for quick response, @Lucaweihs ! First of all, I tried reducing the number of processes but also stopping issue happened again. To debug this, I captured one phenomenon on this issue. While each episode, I've tried to print where does the stopping point happens. It turns out it happened on the Second, I've tried the disturb-free repository that you recommended, and the same experiment happens again (even with less number of processes). By the way, I've tried to another machine with 4 GeForce RTX 3090, but it does not work as well. I would attach the details of my machine.
|
Hi. Thanks for your work!
I followed your instructions and checked sometimes the X server works, and sometimes it doesn't. To be specific, the screen just stops and don't move. This happens right after the initialization stage. The final code of the screen is shown as follows:
After this there is no console information that is given, and I confirmed that the entire system is not working. Do you know when does this happen and how can I solve this?
When I terminated the process the error is given as follows:
Lastly, I want to ask is there any upgrade plans for this framework. Compared to the allenact repository, this framework may be seen as quite outdated (e.g., the ai2thor version is 0.0.1, but the current version is 5.0.0). I'd really appreciate if this is taken into consideration.
Thank you.
The text was updated successfully, but these errors were encountered: