PSTSWM T3D Algorithm Comparison

Performance Studies using

PSTSWM


SGI/Cray Research T3D Algorithm Comparison

(distributed LT experiment II-A )

Date/Person: October 28, 1994 / P. Worley
Platform: T3D at Cray Research in Eagen, MN (rain):
   128 150-MHz DEC Alpha EV4 RISC processors
Environment: IS-9.0
Cray CF90 Version 0.1.2.3
Code Version: 6.1
Compilation Options: /mpp/bin/f90 -dp -Oscalar3
Math Library: none
Communication Library: SHMEM
Number of Timesteps: 12
Results:

Distributed LT (2) (shmem)
Algorithm Comparison
  T42L1     T21L2     T42L2     T85L2     T85L1  
  P=32     P=16     P=8     P=32     P=16  
  optimal algorithm   halfsum  halfsum  halfsum  halfsum  halfsum 
  (generic-min)/min     0.186    0.112    0.024    0.042    0.017 


Patrick H. Worley / ( worleyph@ornl.gov)
Last Modified Monday, 15-Jul-2002 10:23:55 EDT.