PSTSWM T3D Algorithm Comparison

Performance Studies using

PSTSWM


SGI/Cray Research T3D Algorithm Comparison

(transpose FFT experiments A )

Date/Person: October 28, 1994 / P. Worley
Platform: T3D at Cray Research in Eagen, MN (rain):
   128 150-MHz DEC Alpha EV4 RISC processors
Environment: IS-9.0
Cray CF90 Version 0.1.2.3
Code Version: 6.1
Compilation Options: /mpp/bin/f90 -dp -Oscalar3
Math Library: none
Communication Library: SHMEM
(assuming distributed FFT)
Number of Timesteps: 12
Results:

Transpose FFT (shmem)
Algorithm Comparison
  T10L16    T10L16    T21L8     T21L32    T21L16 
  P=32     P=16     P=8     P=32     P=16  
  optimal algorithm   swtrans  swtrans  swtrans  swtrans  swtrans 
  (generic-min)/min     0.112    0.056    0.022    0.039    0.026 


Patrick H. Worley / ( worleyph@ornl.gov)
Last Modified Monday, 15-Jul-2002 10:24:13 EDT.