Mpich2 & FDS: unable to connect

897 views
Skip to first unread message

Magnus Magnusson

unread,
Jun 19, 2008, 12:31:34 PM6/19/08
to FDS and Smokeview Discussions
Hello!

I have a little problem setting up mpich2 (and now mpich1 also).

We are running 5 dualcore PC with Windows XP. Every PC can see the
others and has got all write perms. No firewall, static IP, on all
maschines fds is installed in the same directory. With FDS4 and MPICH
all worked fine... for years...

Now I installed mpich2 & FDS5...

- mpch1 & fds4 stopped working after install:
When I'm starting a calculation folloing error appears (pc12-12 is a
name of a PC):

MPIRunLaunchProcess:Connect to pc12-12 failed, error: Error:
ConnectToMPDReport(pc12-12:8675): easy_connect failed: error 10035,
Host not found

- mpch2 & fds 5 do not work and quits with the same Error (10065 &
10035)

I checked port 8675 -> it should be opened (no firewall, even not
windowsfirewall is active).

I uninstalled all FDS5 and mpich2 and reinstalled them but no change.

Any ideas?

Best regards,
Magnus Magnusson

JWilliamson

unread,
Jun 19, 2008, 1:06:32 PM6/19/08
to FDS and Smokeview Discussions
Check each PC for a text file called:

C:\WINDOWS\system32\drivers\etc\hosts

Make sure this file contains the IP information of all of the
connected computers.

If that's not your problem, I would look into the user accounts
(windows login) to make sure that each computer has identical user
account information, all with administrator privileges. If they are
not identical in every way, you will have trouble getting MPICH2 to
work.

Also, there is a diagnostic tool built in with MPICH2 that stores your
user name and password information. If you're not careful when you set
that up initially, it will give you trouble even after uninstall->re-
install. Read the MPICH2 manual to find out how to clear the password
information, and do it locally on every computer. The next time to try
to use MPI, it will ask you for the user name and password. The proper
user name should be included by default by windows, so just hit return
for that field. Then enter your chosen password.

Also, whenever you make a change it may be necessary to reboot the
machine, even if windows does not prompt you to do so. Just do it
every time, just to be sure.

Good luck! I recently had similar troubles and was only able to
resolve them with some help from IT to get me going in the right
direction.

Justin

Kevin

unread,
Jun 19, 2008, 1:06:44 PM6/19/08
to FDS and Smokeview Discussions
Search the entire Group for MPICH2. Also search the Issue Tracker.
There have been numerous discussions of this.

On Jun 19, 12:31 pm, Magnus Magnusson <mag...@secretsurf.de> wrote:

Magnus Magnusson

unread,
Jun 20, 2008, 4:55:33 AM6/20/08
to FDS and Smokeview Discussions
Thanks for the help, but these settings are correct.
- all hosts are in the hostfile with correct IPs
- all Accounts have same ID, permissions and password

I will try to reinstall all Mpich2. lets see what will happen :-D


On Jun 19, 7:06 pm, JWilliamson <williamson.justin.w...@gmail.com>
wrote:

Kevin

unread,
Jun 20, 2008, 7:51:53 AM6/20/08
to FDS and Smokeview Discussions
What happens when you type:

mpiexec -n 2 -localonly fds5_mpi device_test.fds

in the directory with the sample input files?
> > > Magnus Magnusson- Hide quoted text -
>
> - Show quoted text -

JWilliamson

unread,
Jun 20, 2008, 12:03:46 PM6/20/08
to FDS and Smokeview Discussions
If you are successful with running Kevin's command line, but it still
doesn't work for multiple PCs, you can check your communication/login
credentials with:

mpiexec -validate [-host hostname]

If the credentials don't pass use:

mpiexec -remove
which removes the username password information in the registry and
mpiexec -register
to input the correct login credentials.

I also recall having trouble with the way my config.txt file was
written. Try switching the way you prescribe the directories between
local "C:\directory\..." and network addresses "\\hostname\directory
\...". I think the local method was the only way I could get it to
work.

Magnus Magnusson

unread,
Jun 24, 2008, 4:42:50 AM6/24/08
to FDS and Smokeview Discussions
Kevin:
If I put your line in a config file it doesn't work -> if I put the
same information in the mpiexec wrapper it works...

Mesh 2 of 2 is alive on pc11-12
Mesh 1 of 2 is alive on pc11-12

then the simulation starts correctly.

If I add a second host the error above occurs.

JWilliamson:

Switching directories from C:\... to \\Host\... changes nothing. It
still work on one host but not on 2 or more.

Kevin

unread,
Jun 24, 2008, 8:16:46 AM6/24/08
to FDS and Smokeview Discussions
The problem is with your network and/or installation of MPICH2. It
appears that the mpi version of FDS is working. Maybe someone can help
you. I have not been able to run parallel jobs on our Windows PCs for
years because of the increasing level of complicated LANs, domains,
security, etc. I run my parallel jobs on a Linux cluster with
virtually all security nonsense disconnected.
> > in the directory with the sample input files?- Hide quoted text -

Magnus Magnusson

unread,
Jun 24, 2008, 8:30:39 AM6/24/08
to FDS and Smokeview Discussions
Thank you so far. I will try some settings and if it do not work I
must switch to Linux...

pippo

unread,
Jun 27, 2008, 10:54:27 AM6/27/08
to FDS and Smokeview Discussions
I have the same problem with windows and MPICH2 1.0.6
My windows network is ok, but when I try to start mpich2 on two
machine, I recevive this warning:

abort: Unable to connect to 'XPIPPO:8676',
sock error: generic socket failure, error stack:
MPIDU_Sock_post_connect(1228): unable to connect to XPIPPO o
n port 8676, exhausted all endpoints (errno -1)
MPIDU_Sock_post_connect(1244): gethostbyname failed (errno 11004)

From the Configurable settings window of MPICH2 I can see all hosts
but I can get settings only from local.
For the second host I receive the error line: "XXHOST: MPICH2 is not
installed or unble to query the host"

I have all firewall down but in C:\Windows\System32\Drivers\etc\host I
can see only
127.0.0.1 localhost
is this a problem?? How I can configure other host ?? Which IP I have
to consider when IP is automatically??

Thanks.
Reply all
Reply to author
Forward
0 new messages