-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
running modeling.py in a cluster environment #2
Comments
Hi Hee Jong, To run parallel-processing replica exchange, IMP must be compiled using MPI. For example, One can then use
which will perform a single modeling run with four replicas. Running multiple modeling runs on a cluster requires setting up a script specific for that cluster software and architecture. Once you have successfully been able to run a single parallel replica exchange simulation using the command above, you should be able to use that line in your cluster submission script. |
This isn't a great idea because it will result in all of IMP being compiled with MPI. Only the |
Thanks for both. @saltzberg I already compiled the imp with the cluster's mpicxx and all that and put it in the module. For example, from previous RNA pol II tutorial, I made the following SLRUM script to submit the modeling job.
it would be awesome if you can help me convert the for loop in bash to mpirun command. @benmwebb Would it cause any serious issues if I set CXX_COMPILER to mpicxx? Due to the cluster environment complexity, I preferred to be explicit so I set that up and compiled. If i don't, sometimes it's a little bit difficult to keep track of which compiler/libraries were used for this specific instance. |
So, I just changed global_output_directory="output" in ReplicaExchange0 and num_frames to fixed value instead of sys.argv to take number from command line and utilized my SLURM script to submit with mpirun I've been watching the log and queue for an hour and it seems emitting expected outputs and not failing. Thanks! |
Ah... seems like it's hitting something with mpirun. Same data, topology and almost identical modeling.py (only change is sys.argv portion)
Any suggestions and/or insights are very much appreciated. Thanks. |
Hi,
I would like to perform the computational expensive modeling.py across multiple nodes.
However, it seems like modeling.py with its associated script is made for a single machine.
Do you have any examples or recommendations how to accomplish that?
I assume that i need to use mpi but I'm not sure how to properly modify to maximize the speed, efficiency, and replica exchange across nodes.
Thanks.
best,
hee jong
The text was updated successfully, but these errors were encountered: