PSTSWM AlphaSC-500 Communication Protocol Performance

Performance Studies using

PSTSWM


Compaq AlphaServer SC Protocol Performance

(transpose LT experiment II-A1 / O(P) swap transpose algorithm)

Date/Person: November 4, 1999 / P. Worley
Platform: Compaq AlphaServer SC at Compaq (fsehpc):
     16 ES40 4-way SMP nodes (500 MHz Alpha 21264 with 4MB L2 cache)
Environment: Digital UNIX V5.0;   RMS 2.3.2
Code Version: 6.6.3
Compilation Options: f90 -O4 -assume accuracy_sensitive -math_library accurate -aling dcommons -align records -arch host -tune host
Math Library: none
Communication Library: MPI
Communication Library: MPI
Parallel Algorithm: swtrans
(assuming transpose FFT)
Number of Timesteps: 12
Results:

1x16 Processors / Problem T21L2
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  3.2613e-02   0.39   0.29   0.95 
Three Fastest
Protocols
1st2nd3rd
  e3   c2   e1 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  1   1   11 

1x8 Processors / Problem T42L2
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  1.0008e-01   0.13   0.13   0.21 
Three Fastest
Protocols
1st2nd3rd
  e3   e2   e1 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  2   3   29 

1x16 Processors / Problem T85L1
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  1.5014e-01   0.10   0.09   0.17 
Three Fastest
Protocols
1st2nd3rd
  e2   a3   e3 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  1   6   29 

1x8 Processors / Problem T85L4
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  1.0246e+00   0.12   0.13   0.20 
Three Fastest
Protocols
1st2nd3rd
  e2   e3   e1 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  1   1   29 


Patrick H. Worley / ( worleyph@ornl.gov)
Last Modified Monday, 15-Jul-2002 10:03:29 EDT.