PSTSWM T3D Algorithm Comparison

Performance Studies using

PSTSWM


SGI/Cray Research T3D Algorithm Comparison

(transpose LT experiments II-A )

Date/Person: October 28, 1994 / P. Worley
Platform: T3D at Cray Research in Eagen, MN (rain):
   128 150-MHz DEC Alpha EV4 RISC processors
Environment: IS-9.0
Cray CF90 Version 0.1.2.3
Code Version: 6.1
Compilation Options: /mpp/bin/f90 -dp -Oscalar3
Math Library: none
Communication Library: SHMEM
Number of Timesteps: 12
Results:

Transpose LT (2) (shmem)
Algorithm Comparison
  T42L1     T21L2     T42L2     T85L2     T85L1  
  P=32     P=16     P=8     P=32     P=16  
  optimal algorithm   swtrans  swtrans  swtrans  swtrans  swtrans 
  (generic-min)/min     0.158    0.077    0.016    0.018    0.015 


Patrick H. Worley / ( worleyph@ornl.gov)
Last Modified Monday, 15-Jul-2002 10:24:15 EDT.