From: Jean-Christophe Ducom <jducom_at_nd.edu>
Date: Mon 04 Nov 2002 12:33:19 -0500

When I try to run a sander job to 8 SMP nodes running Linux Redhat 7.2
(kernel 2.4.18) using mpich 1.2.4, I get the following error messages:

*if setenv P4_GLOBMEMSIZE 16000000(or higher), then I get the error
p2_25612: p4_error: interrupt SIGSEGV: 11
p4_22913: p4_error: interrupt SIGSEGV: 11
Broken pipe
Broken pipe
bm_list_26381: (8.040565) wakeup_slave: unable to interrupt slave 0 pid

*if setenv P4_GLOBMEMSIZE 14000000, then:
p2_25887: (6.780981) xx_shmalloc: returning NULL; requested 13914960 bytes
p2_25887: (6.781052) p4_shmalloc returning NULL; request = 13914960 bytes
You can increase the amount of memory by setting the environment variable
P4_GLOBMEMSIZE (in bytes); the current size is 14000000
p2_25887: p4_error: alloc_p4_msg failed: 0
Broken pipe
bm_list_14913: (7.010840) wakeup_slave: unable to interrupt slave 0 pid

Every node (Dual Xeon 1.7Ghz) has 1GB of memory.
# cat /proc/sys/kernel/shmmax ->536870912

The file size.h has been modified as following:
     parameter (MAXREA=3800000)
     parameter (MAXINT=2750000)
     parameter (MAXHOL=1000000)
     parameter (MAXPR=5000000)
     parameter (MAXDUP=8000)
c --- allocate a "stack" space for temporary real variables:
c (size depends on the problem: the maximum value used is reported
c at the end of a calculation)
     parameter (MAX_RSTACK=1600000)
     parameter (MAX_ISTACK=100000)
     parameter (MAX_STACK_PTRS=100)
     parameter (MAX_HEAP_PTRS=100)

Any idea?
Thanks a lot


237 Nieuwland Science Hall
Notre Dame, IN 46556
Received on Mon Nov 04 2002 - 09:33:19 PST
Custom Search