-
Notifications
You must be signed in to change notification settings - Fork 843
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unable to run openMPI from two machines #12493
Comments
|
Thanks for the reply but I am not sure I understand what you mean I should change. |
make sure there is no firewall between both hosts (passwordless |
It looks like this issue is expecting a response, but hasn't gotten one yet. If there are no responses in the next 2 weeks, we'll assume that the issue has been abandoned and will close it. |
Per the above comment, it has been a month with no reply on this issue. It looks like this issue has been abandoned. I'm going to close this issue. If I'm wrong and this issue is not abandoned, please feel free to re-open it. Thank you! |
Please submit all the information below so that we can understand the working environment that is the context for your question.
Background information
What version of Open MPI are you using? (e.g., v4.1.6, v5.0.1, git branch name and hash, etc.)
It should be the latest version from openFoam installation (a 4 version) but I also built the latest version from your website (version5)
I probably have two version 4 and 5
Describe how Open MPI was installed (e.g., from a source/distribution tarball, from a git clone, from an operating system distribution package, etc.)
I have installed latest version of openFoam 2312 which comes with a openMPI version 4.
I also built the latest version 5 from your website
If you are building/installing from a git clone, please copy-n-paste the output from
git submodule status
.Please describe the system on which you are running
Details of the problem
I have two machines A and B with identical HW and SW. They seem to have no problems in ssh and sharing a folder (on A).
I can regularly run an example (as the hello_c.c from example folder) or a openfoam simulation in parallel using the 64 cores of a single machine with a command : ...$ mpirun -np 64 ./hello . Either on A and B machine.
If I try to run both machines as for example ...$ mpirun --hostfile /etc/hosts -np 168 ./hello the terminal hangs and no output is shown (error messages neither).
I am attaching some of the configurations of my system and the strace final part of the command ...$ strace mpirun --hostfile /etc/hosts -np 128 ./hello
Note: If you include verbatim output (or a code block), please use a GitHub Markdown code block like below:
documents.zip
The text was updated successfully, but these errors were encountered: