Installation and get on working with pdsh.

910 views
Skip to first unread message

MiNdFrEak

unread,
Sep 18, 2012, 6:52:49 PM9/18/12
to pdsh-...@googlegroups.com
Hello,

Recently I tried to install pdsh for using pdcp. I have 4 remote machines. I need to use one of them to issue pdcp
command so that a file can be sent parallaly to  other 3 remote machine. The machines I am using are 
a part of a grid,say named as comgrid. suppose the names of the machines are:

machine#1: oriental
machine#2: asper
machine#3: looper
machine#4: normandy

so those are identified as suffixed "comgrid.ac" with there name.

for an example machine#1 address is: oriental.comgrid.ca.
username: mindfreak
oriental is used as the root site(here 'root' means the pc from where I am going to send a file to remote host parallaly.i.e issue:pdcp command).

I downloaded, pdsh-2.26.tar.bz2 in one of them and scp that to all others, then untar it using tar -jvxf pdsh-2.26.tar.bz2, in their: /home/mindfreak/ directory.

The steps I followed to install pdsh:
--------------------------------------------------

1.I configured all the 4 machines for passwordless access from 'Oriental'-the root(by copying public keys of root to them and vice-versa).

2. In the Oriental, I issued following command(inside the pdsh-2.26 directory) ./configure --with-ssh --prefix=/home/mindfreak/local, then 'make',then 'make install', I repeated the same process in other hosts [ where I have also the same file structure under home/mindfreak,as I said earlier].

The problems that occured:
----------------------------------------
1.  When I issued pdcp command (i.e:"../local/bin/pdcp  -w mind...@asper.comgrid.ac,looper.comgrid.ac test.txt /home/mindfreak/Test_Data") from Oriental,to send same files to the remote machines(i.e:asper and looper),it returned,"rcmd socket: permission denied" error.

2. Then I tried the same command using -R ssh like this:"../local/bin/pdcp -R ssh -w mind...@asper.comgrid.ac,looper.comgrid.ac test.txt /home/mindfreak/Test_Data" from the Test_Data folder of root. It returned the following error:"looper: bash: /global/home/mindfreak/Test_Data/../local/bin/pdcp: No such file or directory pdcp@seawolf2: looper: ssh exited with exit code 127 asper: bash: /global/home/mindfreak/Test_Data/../local/bin/pdcp: No such file or directory pdcp@seawolf2: asper: ssh exited with exit code 127".

NB: I dont have permission to any of the machine's /local/usr/bin and /etc so I had to install it in my own path: /home/mindfreak/local.

I will be very grateful if anyone helps me finding out and solve the possible problem.

Thanks,

MiNdFrEaK


Mark Grondona

unread,
Sep 18, 2012, 7:38:22 PM9/18/12
to pdsh-...@googlegroups.com, me....@gmail.com, pdsh-...@lists.sourceforge.net
Hi, I'm not sure if I've fully transitioned the pdsh-users list to 
googlegroups yet, so perhaps no one else will see this message.

I'm cc'ing pdsh-...@lists.sourceforge.net on the reply (inline below)

On Tue, Sep 18, 2012 at 10:52 PM, MiNdFrEak <me....@gmail.com> wrote:
Hello,

Recently I tried to install pdsh for using pdcp. I have 4 remote machines. I need to use one of them to issue pdcp
command so that a file can be sent parallaly to  other 3 remote machine. The machines I am using are 
a part of a grid,say named as comgrid. suppose the names of the machines are:

machine#1: oriental
machine#2: asper
machine#3: looper
machine#4: normandy

so those are identified as suffixed "comgrid.ac" with there name.

for an example machine#1 address is: oriental.comgrid.ca.
username: mindfreak
oriental is used as the root site(here 'root' means the pc from where I am going to send a file to remote host parallaly.i.e issue:pdcp command).

I downloaded, pdsh-2.26.tar.bz2 in one of them and scp that to all others, then untar it using tar -jvxf pdsh-2.26.tar.bz2, in their: /home/mindfreak/ directory.

The steps I followed to install pdsh:
--------------------------------------------------

1.I configured all the 4 machines for passwordless access from 'Oriental'-the root(by copying public keys of root to them and vice-versa).

2. In the Oriental, I issued following command(inside the pdsh-2.26 directory) ./configure --with-ssh --prefix=/home/mindfreak/local, then 'make',then 'make install', I repeated the same process in other hosts [ where I have also the same file structure under home/mindfreak,as I said earlier].
The problems that occured:
----------------------------------------
1.  When I issued pdcp command (i.e:"../local/bin/pdcp  -w mind...@asper.comgrid.ac,looper.comgrid.ac test.txt /home/mindfreak/Test_Data") from Oriental,to send same files to the remote machines(i.e:asper and looper),it returned,"rcmd socket: permission denied" error.

Add --without-rsh to the pdsh ./configure options. You don't want/need rcmd support and
this is the default remote connect method, so best not to build and install it. You can also
remove the rcmd.so by hand after the fact.

 
2. Then I tried the same command using -R ssh like this:"../local/bin/pdcp -R ssh -w mind...@asper.comgrid.ac,looper.comgrid.ac test.txt /home/mindfreak/Test_Data" from the Test_Data folder of root. It returned the following error:"looper: bash: /global/home/mindfreak/Test_Data/../local/bin/pdcp: No such file or directory pdcp@seawolf2: looper: ssh exited with exit code 127 asper: bash: /global/home/mindfreak/Test_Data/../local/bin/pdcp: No such file or directory pdcp@seawolf2: asper: ssh exited with exit code 127".

pdsh (running as pdcp) needs to be able to find the pdcp binary on the remote system
since the program acts as the remote copy server as well as the client. You can override
the default location with the PDSH_REMOTE_PDCP_PATH environment variable, or use
 -e PATH 
on the command line.

Try that and let us know what snag you hit next ;-)

mark
Reply all
Reply to author
Forward
0 new messages