PSTSWM AlphaSC-500 Communication Protocol Performance

Performance Studies using

PSTSWM


Compaq AlphaServer SC Protocol Performance

(transpose FFT experiment A1 / O(P) sendrecv transpose algorithm)

Date/Person: November 4, 1999 / P. Worley
Platform: Compaq AlphaServer SC at Compaq (fsehpc):
     16 ES40 4-way SMP nodes (500 MHz Alpha 21264 with 4MB L2 cache)
Environment: Digital UNIX V5.0;   RMS 2.3.2
Code Version: 6.6.3
Compilation Options: f90 -O4 -assume accuracy_sensitive -math_library accurate -aling dcommons -align records -arch host -tune host
Math Library: none
Communication Library: MPI
Parallel Algorithm: srtrans
Number of Timesteps: 12
Results:

16x1 Processors / Problem T10L16
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  8.5233e-02   0.23   0.12   2.16 
Three Fastest
Protocols
1st2nd3rd
  a3   a6   a2 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  1   8   19 

8x1 Processors / Problem T21L8
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  1.4908e-01   0.06   0.05   0.11 
Three Fastest
Protocols
1st2nd3rd
  e3   a1   a6 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  3   15   29 

16x1 Processors / Problem T21L16
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  2.2204e-01   0.09   0.08   0.22 
Three Fastest
Protocols
1st2nd3rd
  a3   a1   e2 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  2   9   29 

8x1 Processors / Problem T42L16
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  1.2350e+00   0.05   0.06   0.10 
Three Fastest
Protocols
1st2nd3rd
  a1   a2   b0 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  2   9   29 


Patrick H. Worley / ( worleyph@ornl.gov)
Last Modified Monday, 15-Jul-2002 10:03:26 EDT.