>#proc #nodes nonsetup %cpu commun.
> [s] on master [s]
>
>1 1 137 100 0.0
>2 2 98 95 1.8
>2 1 92 98 1.7
>4 4 59 85 3.3
>4 2 56 91 2.4
>6 6 48 74 3.7
>6 3 44 79 3.5
>8 8 53 56 5.7
>8 4 37 74 3.2
>10 10 38 69 4.8
>10 5 36 67 4.3
>12 12 35 66 4.7
>12 6 34 62 3.9
>14 14 33 57 4.5
>14 7 31 60 4.0
>16 8 29 56 3.9
>As you can see the scaling is not perfect but it is not so bad,
>especially up to 8 processors on 4 nodes.
If I read this properly, you're getting a scaling of 48% for the second node (from 137
to 92 seconds), are you sure that is "not too bad"?
>A few more details : PROWAT is Amber 4.1 benchmark: plastocyanin in water
>(11585 atoms) with cut=12.0. Sander was compiled with g77 2.95.2 and MPICH
>1.2.0, with the following options :
>
>-fomit-frame-pointer -malign-double -fcaller-saves -fno-f2c
>-march=pentiumpro -funix-intrinsics-hide -O6
I did some experimenting with the various compiler flags, the current
fastest I've found (using egcs-1.1.2) is
-O3 -m486 -malign-double -ffast-math -fno-strength-reduce
-fomit-frame-pointer slowed things down, while -fno-strength-reduce speeded
things up. However, egcs-1.1.2 differs from gcc 2.95.2. I was planning to
compare the two compilers some time soon.
Dave
Received on Wed Jun 14 2000 - 10:18:16 PDT