PSTSWM T3D Communication Protocol Performance

Performance Studies using

PSTSWM


SGI/Cray Research T3D Protocol Performance

(transpose FFT experiment A / O(P) swap transpose algorithm)

Date/Person: October 28, 1994 / P. Worley
Platform: T3D at Cray Research in Eagen, MN (rain):
   128 150-MHz DEC Alpha EV4 RISC processors
Environment: IS-9.0
Cray CF90 Version 0.1.2.3
Code Version: 6.1
Compilation Options: /mpp/bin/f90 -dp -Oscalar3
Math Library: none
Communication Library: SHMEM
Parallel Algorithm: swtrans
Number of Timesteps: 12
Results:

16x1 Processors / Problem T10L16
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  2.0311e-01   0.07   0.05   0.18 
Three Fastest
Protocols
1st2nd3rd
  a2   c2   d6 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  1   6   12 

32x1 Processors / Problem T10L16
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  2.0069e-01   0.06   0.07   0.15 
Three Fastest
Protocols
1st2nd3rd
  a2   e2   d6 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  2   6   12 

8x1 Processors / Problem T21L8
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  7.7707e-01   0.04   0.02   0.14 
Three Fastest
Protocols
1st2nd3rd
  a2   c2   e2 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  3   7   12 

16x1 Processors / Problem T21L16
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  8.4466e-01   0.05   0.04   0.14 
Three Fastest
Protocols
1st2nd3rd
  a2   c2   d6 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  2   7   12 

32x1 Processors / Problem T21L32
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  9.0662e-01   0.05   0.03   0.14 
Three Fastest
Protocols
1st2nd3rd
  a2   c2   d6 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  2   7   12 


Patrick H. Worley / ( worleyph@ornl.gov)
Last Modified Monday, 15-Jul-2002 10:24:11 EDT.