Re: parallel jobs die with no error message from sander

From: Joffre Heredia <joffre_at_yogi.uab.es>
Date: Tue 30 Jul 2002 10:46:37 -0700

 Our nodes are isolated from the rest of the network. They are in a
different network. I guess it could be a problem with mpich, what do you
think?

-------------------------------------------------------------
Joffre Heredia Rodrigo Tel: (34)-93-5813812
Laboratory of Computational Medicine Fax: (34)-93-5812344
Biostatistic Dept.
UAB School of Medicine. Bellaterra Joffre.Heredia_at_uab.es
08193-Barcelona (SPAIN)
-------------------------------------------------------------

On Mon, 29 Jul 2002, jim caldwell wrote:

>
> That looks like a communication problem to me. Are your machines
> on a busy network? Can you isolate the compute nodes behind a
> router/switch?
>
> jim
>
Received on Tue Jul 30 2002 - 10:46:37 PDT
Custom Search