PSTSWM AlphaSC-500 Point-to-Point Communication Performance

Performance Studies using

PSTSWM


Compaq AlphaServer SC SWAP Performance

(ordered swap of 2MB message using MPI between two nodes)

(performance measured per processor when all processors in node communicating)

Date/Person: January 26, 2000 / P. Worley
Platform: Compaq AlphaServer SC at Oak Ridge National Laboratory (colt.ccs.ornl.gov):
     16 ES40 4-way SMP nodes (500 MHz Alpha 21264 with 4MB L2 cache)
Environment: Digital UNIX V5.0;   RMS 2.36
Communication Library: MPI
SWAP size: 262144 REAL*8 floating point values each direction
Message size: Largest - 262144 REAL*8 floating point values
Smallest - 256 REAL*8 floating point values
Processors: 0 and 4
1 and 5
2 and 6
3 and 7
Latency Definition:(T1024-T512)/512
Model Error Range:[1,1024]
Results:

ordered simple swap
Data Statistics
unidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 39.62 13.52 0.8%
10 iter. 40.34 14.09 1.7%

ordered swap using nonblocking send
Data Statistics
unidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 39.79 13.47 1.0%
10 iter. 39.86 12.94 0.5%
1 iter. w/overlap 39.43 14.65 0.9%
10 iter. w/overlap 39.89 15.08 0.6%

ordered swap using nonblocking receive
Data Statistics
unidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 39.59 14.72 0.7%
10 iter. 39.82 14.60 0.8%
1 iter. w/overlap 39.48 17.21 3.6%
10 iter. w/overlap 39.87 17.90 4.6%

ordered swap using nonblocking send and receive
Data Statistics
unidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 39.61 15.04 1.2%
10 iter. 40.19 14.82 1.2%
1 iter. w/overlap 39.60 17.72 3.3%
10 iter. w/overlap 40.06 19.73 5.3%

ordered swap using ready send
Data Statistics
unidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 39.50 21.71 1.9%
10 iter. 40.29 21.51 3.1%
1 iter. w/overlap 39.74 17.39 3.2%
10 iter. w/overlap 40.14 17.80 3.9%

ordered swap using nonblocking ready send
Data Statistics
unidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 39.69 22.97 1.3%
10 iter. 40.31 21.37 3.6%
1 iter. w/overlap 39.47 17.70 3.6%
10 iter. w/overlap 39.81 19.37 5.3%

synchronous
Data Statistics
unidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 39.52 24.16 6.0%
10 iter. 40.10 24.97 5.1%

ordered swap using nonblocking sync. send
Data Statistics
unidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 39.62 13.51 0.9%
10 iter. 39.99 12.61 1.0%
1 iter. w/overlap 39.59 14.59 0.9%
10 iter. w/overlap 39.97 14.95 0.8%

ordered swap using nonblocking receive with sync. send
Data Statistics
unidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 39.49 14.68 1.5%
10 iter. 40.10 14.48 1.2%
1 iter. w/overlap 39.52 17.08 3.3%
10 iter. w/overlap 40.39 18.08 3.9%

ordered swap using nonblocking sync. send and receive
Data Statistics
unidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 39.44 14.65 1.1%
10 iter. 40.60 14.37 2.5%
1 iter. w/overlap 39.43 17.55 3.5%
10 iter. w/overlap 39.90 19.01 4.5%

ordered simple swap using sync. send
Data Statistics
unidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 39.54 13.93 0.8%
10 iter. 39.76 13.62 1.3%


Protocol Sensitivity Summary for Unidirectional Swap of 2097152 Bytes (1 iterations/no overlap)
Runtime Statistics
Msg Sizemin Secmin Sec/Msgmax MBytes/Sec(mean-min)/min(median-min)/min(max-min)/min
  2048   1.3301e-01   1.2990e-04   31.53   0.05   0.02   0.16 
  4096   1.1918e-01   2.3276e-04   35.19   0.02   0.01   0.10 
  8192   1.1264e-01   4.3999e-04   37.24   0.01   0.00   0.05 
  16384   1.0883e-01   8.5023e-04   38.54   0.01   0.01   0.01 
  32768   1.0655e-01   1.6649e-03   39.36   0.01   0.00   0.01 
  65536   1.0610e-01   3.3157e-03   39.53   0.00   0.00   0.01 
  131072   1.0567e-01   6.6043e-03   39.69   0.00   0.00   0.01 
  262144   1.0614e-01   1.3268e-02   39.52   0.00   0.00   0.00 
  524288   1.0540e-01   2.6350e-02   39.79   0.01   0.01   0.01 
  1048576   1.0589e-01   5.2943e-02   39.61   0.01   0.01   0.01 
  2097152   1.0642e-01   1.0642e-01   39.41   0.00   0.00   0.00 
Five Fastest
Protocols
Msg Size1st2nd3rd4th5th
  2048   7   1   0   10   9 
  4096   7   10   1   0   3 
  8192   0   10   7   2   8 
  16384   3   2   8   9   7 
  32768   8   7   0   1   3 
  65536   3   8   5   1   7 
  131072   5   7   0   1   2 
  262144   6   4   3   5   7 
  524288   1   10   3   6   2 
  1048576   3   5   9   8   1 
  2097152   9   1   8   3   10 
       Number of Proctocols With
Runtimes Within X% of Min
Msg Size1%5%25%
  2048    4   8   11 
  4096    8   9   11 
  8192    8   10   11 
  16384    9   11   11 
  32768    8   11   11 
  65536    11   11   11 
  131072    11   11   11 
  262144    11   11   11 
  524288    7   11   11 
  1048576    11   11   11 
  2097152    11   11   11 


Protocol Sensitivity Summary for Unidirectional Swap of 2097152 Bytes (10 iterations/no overlap)
Runtime Statistics
Msg Sizemin Secmin Sec/Msgmax MBytes/Sec(mean-min)/min(median-min)/min(max-min)/min
  2048   1.3187e-01   1.2878e-04   31.81   0.05   0.02   0.16 
  4096   1.1878e-01   2.3199e-04   35.31   0.02   0.01   0.10 
  8192   1.1206e-01   4.3772e-04   37.43   0.01   0.00   0.05 
  16384   1.0857e-01   8.4824e-04   38.63   0.00   0.00   0.01 
  32768   1.0556e-01   1.6493e-03   39.74   0.01   0.01   0.02 
  65536   1.0551e-01   3.2971e-03   39.75   0.01   0.01   0.01 
  131072   1.0528e-01   6.5801e-03   39.84   0.00   0.00   0.00 
  262144   1.0483e-01   1.3104e-02   40.01   0.01   0.01   0.01 
  524288   1.0405e-01   2.6014e-02   40.31   0.01   0.01   0.02 
  1048576   1.0331e-01   5.1655e-02   40.60   0.02   0.02   0.02 
  2097152   1.0437e-01   1.0437e-01   40.19   0.01   0.01   0.02 
Five Fastest
Protocols
Msg Size1st2nd3rd4th5th
  2048   7   1   10   0   9 
  4096   10   1   0   7   9 
  8192   10   0   7   1   9 
  16384   8   2   3   9   7 
  32768   0   10   2   1   9 
  65536   10   7   0   1   3 
  131072   3   2   1   10   8 
  262144   0   5   9   7   1 
  524288   5   6   2   1   4 
  1048576   9   0   4   8   7 
  2097152   3   5   0   1   7 
       Number of Proctocols With
Runtimes Within X% of Min
Msg Size1%5%25%
  2048    3   8   11 
  4096    8   9   11 
  8192    8   9   11 
  16384    9   11   11 
  32768    7   11   11 
  65536    11   11   11 
  131072    11   11   11 
  262144    11   11   11 
  524288    2   11   11 
  1048576    3   11   11 
  2097152    4   11   11 


Protocol Sensitivity Summary for Unidirectional Swap of 2097152 Bytes (1 iterations with overlap)
Runtime Statistics
Msg Sizemin Secmin Sec/Msgmax MBytes/Sec(mean-min)/min(median-min)/min(max-min)/min
  2048   1.3545e-01   1.3228e-04   30.96   0.02   0.01   0.12 
  4096   1.1954e-01   2.3348e-04   35.09   0.01   0.00   0.05 
  8192   1.1260e-01   4.3985e-04   37.25   0.01   0.00   0.02 
  16384   1.0895e-01   8.5113e-04   38.50   0.00   0.00   0.01 
  32768   1.0602e-01   1.6566e-03   39.56   0.01   0.01   0.02 
  65536   1.0581e-01   3.3066e-03   39.64   0.00   0.00   0.01 
  131072   1.0577e-01   6.6108e-03   39.65   0.00   0.01   0.01 
  262144   1.0554e-01   1.3193e-02   39.74   0.01   0.01   0.01 
  524288   1.0619e-01   2.6548e-02   39.50   0.00   0.00   0.00 
  1048576   1.0631e-01   5.3153e-02   39.46   0.00   0.00   0.00 
  2097152   1.0646e-01   1.0646e-01   39.40   0.01   0.00   0.05 
Five Fastest
Protocols
Msg Size1st2nd3rd4th5th
  2048   1   7   10   0   8 
  4096   2   4   8   5   9 
  8192   8   4   2   5   9 
  16384   2   8   4   5   3 
  32768   0   10   2   4   9 
  65536   0   7   6   4   8 
  131072   10   6   8   4   3 
  262144   4   3   8   2   5 
  524288   8   4   9   0   5 
  1048576   2   3   4   10   7 
  2097152   2   1   9   3   5 
       Number of Proctocols With
Runtimes Within X% of Min
Msg Size1%5%25%
  2048    3   10   11 
  4096    10   11   11 
  8192    10   11   11 
  16384    10   11   11 
  32768    3   11   11 
  65536    11   11   11 
  131072    11   11   11 
  262144    10   11   11 
  524288    11   11   11 
  1048576    11   11   11 
  2097152    9   11   11 


Protocol Sensitivity Summary for Unidirectional Swap of 2097152 Bytes (10 iterations with overlap)
Runtime Statistics
Msg Sizemin Secmin Sec/Msgmax MBytes/Sec(mean-min)/min(median-min)/min(max-min)/min
  2048   1.3513e-01   1.3196e-04   31.04   0.02   0.01   0.12 
  4096   1.1775e-01   2.2997e-04   35.62   0.02   0.01   0.06 
  8192   1.1139e-01   4.3512e-04   37.65   0.01   0.01   0.03 
  16384   1.0867e-01   8.4896e-04   38.60   0.01   0.00   0.01 
  32768   1.0563e-01   1.6505e-03   39.71   0.01   0.01   0.02 
  65536   1.0556e-01   3.2987e-03   39.73   0.00   0.00   0.01 
  131072   1.0527e-01   6.5795e-03   39.84   0.00   0.00   0.01 
  262144   1.0489e-01   1.3111e-02   39.99   0.01   0.01   0.01 
  524288   1.0507e-01   2.6266e-02   39.92   0.00   0.00   0.01 
  1048576   1.0385e-01   5.1924e-02   40.39   0.01   0.01   0.02 
  2097152   1.0443e-01   1.0443e-01   40.16   0.01   0.01   0.02 
Five Fastest
Protocols
Msg Size1st2nd3rd4th5th
  2048   7   1   10   8   0 
  4096   8   2   4   3   5 
  8192   5   9   3   2   8 
  16384   4   5   9   3   8 
  32768   10   0   2   3   9 
  65536   10   0   3   5   1 
  131072   9   10   5   1   2 
  262144   4   6   9   8   2 
  524288   8   2   0   6   9 
  1048576   8   4   3   6   0 
  2097152   0   6   9   8   5 
       Number of Proctocols With
Runtimes Within X% of Min
Msg Size1%5%25%
  2048    7   10   11 
  4096    5   10   11 
  8192    6   11   11 
  16384    11   11   11 
  32768    3   11   11 
  65536    11   11   11 
  131072    11   11   11 
  262144    11   11   11 
  524288    11   11   11 
  1048576    4   11   11 
  2097152    4   11   11 

DISCUSSION


Patrick H. Worley / ( worleyph@ornl.gov)
Last Modified Monday, 15-Jul-2002 10:03:02 EDT.
86110 accesses since 1/2/96.