Creating User Accounts on worker nodes

11 views
Skip to first unread message

John Rieffel

unread,
Apr 24, 2022, 3:50:09 PM4/24/22
to elasticluster
Hi Y'all,

This may be more of a slurm/MPI installation question than an elasticluster question - but the problem is occurring on a slurm/MPI cluster I built using elasticluster.

When I add new users using `adduser`, new home directories are created for them correctly.  But when those users connect to a worker (via `srun -N 4 -t 10 --pty bash`), they have a userid but no username -- and no matching entry in /etc/passwd.  As a consequence, they cannot run jobs.   However everything works fine for the original default user.   

The specific error is as follows:

rieffelj@master001:~$ srun -N 4 -t 10 --pty bash

I have no name!@worker001:~$ mpirun -np 4 ./a.

I have no name!@worker001:~$ mpirun -np 4 ./a.out

--------------------------------------------------------------------------

An ORTE daemon has unexpectedly failed after launch and before

communicating back to mpirun. This could be caused by a number

of factors, including an inability to create a connection back

to mpirun due to a lack of common network interfaces and/or no

route found between them. Please check network connectivity

(including firewalls and network routing requirements).

--------------------------------------------------------------------------

Any thoughts on what I'm doing wrong?  I'm certain I used this same workflow two years ago without any problems.


jr


Riccardo Murri

unread,
May 5, 2022, 3:05:14 PM5/5/22
to John Rieffel, elasticluster
Hello,

sorry for the late reply -- it looks like NIS/YP is not running, or not exporting the users to the worker nodes for some reason.  Can you try running this on the master node (after creating the users with `adduser`) and see if it fixes the problem?

cd /var/yp
sudo make

Cheers,
Riccardo
Reply all
Reply to author
Forward
0 new messages