Hi all,
I am trying to run parallel_stereo on a cluster using SLURM. I have got errors related to SSH (please see below). Whatever I try, I am keep getting ssh-related errors as soon as I use more than one node. I carefully followed the SLURM example provide in the ASP doc. In this doc, it is mentioned that the nodes must be able to communicate with each other over ssh without a password. This is unfortunate, but I do not think our cluster allows that. I ran a simple job that tried to ssh from one node to another and the connection was blocked. Is there any work around for that in ASP? Is parallel_stereo able to use another communication mechanism, different than ssh?
Many thanks!
Best regards,
- David
parallel: Warning: ssh to ib16be-114 only allows for 0 simultaneous logins.
parallel: Warning: You may raise this by changing
parallel: Warning: /etc/ssh/sshd_config:MaxStartups and MaxSessions on ib16be-114.
parallel: Warning: Using only -1 connections to avoid race conditions.
parallel: Warning: ssh to ib16be-116 only allows for 0 simultaneous logins.
parallel: Warning: You may raise this by changing
parallel: Warning: /etc/ssh/sshd_config:MaxStartups and MaxSessions on ib16be-116.
parallel: Warning: Using only -1 connections to avoid race conditions.
parallel: Error: Cannot run any jobs.
--
You received this message because you are subscribed to the Google Groups "Ames Stereo Pipeline Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email to ames-stereo-pipeline...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/ames-stereo-pipeline-support/fb30bef6-3ead-4934-867d-120b00ff0599n%40googlegroups.com.
Hi Oleg,
We found that the problem was not that the cluster blocks SSH as we initially thought. This is rather that the nodes response to none-default SSH ports that change (e.g., 2200, 2201). I scratched my head the whole day trying to figure how to go around that none-default port issue. May you have any clue about that?
I tried to pass a node list that looks like:
hostname1:port1
hostname2:port2
…
Or
-p port1 hostname1
-p port2 hostname2
Please note that knowing the port # is not a problem. I use $cat /app/config/ssh-port.txt, to get it. It is really to pass the ports to parallel_stereo with hope that GNU Parallel would be able to connect to the nodes. From the GNU Parallel doc, I have not seen any hook allowing us to set the SSH port.
Many thanks,
Best,
- David
To view this discussion on the web visit https://groups.google.com/d/msgid/ames-stereo-pipeline-support/27541b58-36f7-4f39-b1a2-2e5f124c33c3n%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/ames-stereo-pipeline-support/4df8de3f-2dbb-4d39-b4cd-0ec41d7c7d40n%40googlegroups.com.