COMS 4995 Issues with LAPI and MPI

8 views
Skip to first unread message

Mashooq Muhaimen

unread,
Dec 11, 2010, 2:00:43 PM12/11/10
to Vijay Saraswat, mak...@columbia.edu, coms49...@googlegroups.com, Sreedhar Kodali
Prof. Saraswat/ Prof. Kim,

I can use the sockets implementation fine, but run into problems when I try to use lapi or mpi. The reason I am trying lapi/mpi is because the X10 Performance tuning page says the following:

"The pgas_sockets implementation of X10RT will perform poorly (high latency, low bandwidth).

If possible (if they are available for your platform), use pgas_lapi or pgas_bgp. As a second choice, use the MPI-based implementation of X10RT."

I tried to follow the instruction here : http://x10.codehaus.org/X10+Application+Development

1) This is what I get for lapi: 

>> poe ./Clr.lapi -hostfile hosts.67200 -procs 1 -msg_api lapi
ERROR: 0031-212  pmd: node porthos.watson.ibm.com: user mm3858 denied from access from host athos.watson.ibm.com
ERROR: 0031-024  porthos.watson.ibm.com: no response; rc = -1

Full sequence of shell commands I used for lapi is attached.

2) Fo MPI, if I follow the instructions on the website, in the very last step when I type in
srun -n2 ./<my_program>.mpi,  Slurm spawns multiple copies of my program( with each copy having numPlaces = 1), which is obviously not what I want.

Any ideas or insights? Did anyone else get these to work? I am looking into my environment variables now.

Thanks,
Mashooq
SlurmLapi.txt
Reply all
Reply to author
Forward
0 new messages