PSTSWM AlphaSC-500 Communication Protocol Performance

Performance Studies using

PSTSWM


Compaq AlphaServer SC Protocol Performance

(transpose FFT experiment A1 / O(log P) transpose algorithm)

Date/Person: November 4, 1999 / P. Worley
Platform: Compaq AlphaServer SC at Compaq (fsehpc):
     16 ES40 4-way SMP nodes (500 MHz Alpha 21264 with 4MB L2 cache)
Environment: Digital UNIX V5.0;   RMS 2.3.2
Code Version: 6.6.3
Compilation Options: f90 -O4 -assume accuracy_sensitive -math_library accurate -aling dcommons -align records -arch host -tune host
Math Library: none
Communication Library: MPI
Parallel Algorithm: logtrans
Number of Timesteps: 12
Results:

16x1 Processors / Problem T10L16
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  8.3529e-02   0.11   0.06   0.87 
Three Fastest
Protocols
1st2nd3rd
  c2   a2   d2 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  1   10   22 

8x1 Processors / Problem T21L8
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  1.7375e-01   0.02   0.02   0.09 
Three Fastest
Protocols
1st2nd3rd
  c3   a2   a1 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  4   23   24 

16x1 Processors / Problem T21L16
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  2.6547e-01   0.07   0.03   0.60 
Three Fastest
Protocols
1st2nd3rd
  c3   d2   c2 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  2   21   22 

8x1 Processors / Problem T42L16
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  1.5774e+00   0.03   0.03   0.07 
Three Fastest
Protocols
1st2nd3rd
  d2   b0   b1 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  1   22   24 


Patrick H. Worley / ( worleyph@ornl.gov)
Last Modified Monday, 15-Jul-2002 10:03:21 EDT.