PSTSWM T3D Communication Protocol Performance

Performance Studies using

PSTSWM


SGI/Cray Research T3D Protocol Performance

(transpose LT experiment II-A / O(P) swap transpose algorithm)

Date/Person: October 28, 1994 / P. Worley
Platform: T3D at Cray Research in Eagen, MN (rain):
   128 150-MHz DEC Alpha EV4 RISC processors
Environment: IS-9.0
Cray CF90 Version 0.1.2.3
Code Version: 6.1
Compilation Options: /mpp/bin/f90 -dp -Oscalar3
Math Library: none
Communication Library: SHMEM
Parallel Algorithm: swtrans
(assuming transpose FFT)
Number of Timesteps: 12
Results:

1x16 Processors / Problem T21L2
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  1.1359e-01   0.06   0.07   0.16 
Three Fastest
Protocols
1st2nd3rd
  a2   e2   c2 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  1   6   12 

1x32 Processors / Problem T42L1
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  1.6162e-01   0.07   0.06   0.16 
Three Fastest
Protocols
1st2nd3rd
  e2   a2   d6 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  1   5   12 

1x8 Processors / Problem T42L2
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  9.1919e-01   0.03   0.01   0.08 
Three Fastest
Protocols
1st2nd3rd
  a2   c2   e2 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  6   10   12 

1x16 Processors / Problem T85L1
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  1.4650e+00   0.02   0.01   0.06 
Three Fastest
Protocols
1st2nd3rd
  c2   a2   d6 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  6   10   12 

1x32 Processors / Problem T85L2
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  1.7795e+00   0.02   0.01   0.05 
Three Fastest
Protocols
1st2nd3rd
  a2   c2   e2 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  7   11   12 


Patrick H. Worley / ( worleyph@ornl.gov)
Last Modified Monday, 15-Jul-2002 10:24:12 EDT.