When I try to run a sander job to 8 SMP nodes running Linux Redhat 7.2
(kernel 2.4.18) using mpich 1.2.4, I get the following error messages:
*if setenv P4_GLOBMEMSIZE 16000000(or higher), then I get the error
message:
p2_25612: p4_error: interrupt SIGSEGV: 11
p4_22913: p4_error: interrupt SIGSEGV: 11
Broken pipe
Broken pipe
bm_list_26381: (8.040565) wakeup_slave: unable to interrupt slave 0 pid
26380
*if setenv P4_GLOBMEMSIZE 14000000, then:
p2_25887: (6.780981) xx_shmalloc: returning NULL; requested 13914960 bytes
p2_25887: (6.781052) p4_shmalloc returning NULL; request = 13914960 bytes
You can increase the amount of memory by setting the environment variable
P4_GLOBMEMSIZE (in bytes); the current size is 14000000
p2_25887: p4_error: alloc_p4_msg failed: 0
Broken pipe
bm_list_14913: (7.010840) wakeup_slave: unable to interrupt slave 0 pid
14912
Every node (Dual Xeon 1.7Ghz) has 1GB of memory.
# cat /proc/sys/kernel/shmmax ->536870912
The file size.h has been modified as following:
--------------------------------------
parameter (MAXREA=3800000)
parameter (MAXINT=2750000)
parameter (MAXHOL=1000000)
parameter (MAXPR=5000000)
parameter (MAXDUP=8000)
c
c --- allocate a "stack" space for temporary real variables:
c (size depends on the problem: the maximum value used is reported
c at the end of a calculation)
c
integer MAX_RSTACK,MAX_ISTACK,MAX_STACK_PTRS,MAX_HEAP_PTRS
parameter (MAX_RSTACK=1600000)
parameter (MAX_ISTACK=100000)
parameter (MAX_STACK_PTRS=100)
parameter (MAX_HEAP_PTRS=100)
c
----------------------------------------
Any idea?
Thanks a lot
JC
--------
237 Nieuwland Science Hall
Notre Dame, IN 46556
Received on Mon Nov 04 2002 - 09:33:19 PST