Re: P4_GLOBMEMSIZE problem

From: Jean-Christophe Ducom <jducom_at_nd.edu>
Date: Tue 05 Nov 2002 12:53:11 -0500

Ross,

    Thanks for your email. Unfortunately it doesn't work :(
a) I get the following error message:
p0_17603: p4_error: exceeding max num of P4_MAX_SYSV_SHMIDS: 256
b)if I set P4_GLOBMEMSIZE to 256MB, I get the same error message as
before (w/ 16MB)

Do you suggest to recompile MPICH w/ higher shared memory segments?
Thanks.

            JC


 Walker wrote:

>Try setting the value much much higher.
>
>On our cluster I have this set in /etc/bashrc to:
>
>Export P4_GLOBMEMSIZE=536870912
>
>(=512MB).
>
>This should work.
>
>All the best
>Ross
>
>/\
>\/
>|\oss Walker
>
>| Imperial College of Science, Technology & Medicine |
>| Department of Chemistry | Theoretical Division |
>| Tel:- +44 20 759(45851) |
>| EMail:- http://www.rosswalker.co.uk/ |
>| PGP Key available on request |
>
>
>-----Original Message-----
>From: Jean-Christophe Ducom [mailto:jducom_at_nd.edu]
>Sent: 04 November 2002 17:33
>To: amber_at_heimdal.compchem.ucsf.edu
>Subject: P4_GLOBMEMSIZE problem
>
>
>When I try to run a sander job to 8 SMP nodes running Linux Redhat 7.2
>(kernel 2.4.18) using mpich 1.2.4, I get the following error messages:
>
>*if setenv P4_GLOBMEMSIZE 16000000(or higher), then I get the error
>message:
>p2_25612: p4_error: interrupt SIGSEGV: 11
>p4_22913: p4_error: interrupt SIGSEGV: 11
>Broken pipe
>Broken pipe
>bm_list_26381: (8.040565) wakeup_slave: unable to interrupt slave 0 pid
>26380
>
>*if setenv P4_GLOBMEMSIZE 14000000, then:
>p2_25887: (6.780981) xx_shmalloc: returning NULL; requested 13914960
>bytes
>p2_25887: (6.781052) p4_shmalloc returning NULL; request = 13914960
>bytes
>You can increase the amount of memory by setting the environment
>variable
>P4_GLOBMEMSIZE (in bytes); the current size is 14000000
>p2_25887: p4_error: alloc_p4_msg failed: 0
>Broken pipe
>bm_list_14913: (7.010840) wakeup_slave: unable to interrupt slave 0 pid
>14912
>
>Every node (Dual Xeon 1.7Ghz) has 1GB of memory.
># cat /proc/sys/kernel/shmmax ->536870912
>
>The file size.h has been modified as following:
>--------------------------------------
> parameter (MAXREA=3800000)
> parameter (MAXINT=2750000)
> parameter (MAXHOL=1000000)
> parameter (MAXPR=5000000)
> parameter (MAXDUP=8000)
>c
>c --- allocate a "stack" space for temporary real variables:
>c (size depends on the problem: the maximum value used is reported
>c at the end of a calculation)
>c
> integer MAX_RSTACK,MAX_ISTACK,MAX_STACK_PTRS,MAX_HEAP_PTRS
> parameter (MAX_RSTACK=1600000)
> parameter (MAX_ISTACK=100000)
> parameter (MAX_STACK_PTRS=100)
> parameter (MAX_HEAP_PTRS=100)
>c
>----------------------------------------
>
>Any idea?
>Thanks a lot
>
> JC
>
>--------
>237 Nieuwland Science Hall
>Notre Dame, IN 46556
>
>
>
>
>
Received on Tue Nov 05 2002 - 09:53:11 PST
Custom Search