PSTSWM Parallel Performance

Performance Studies using

PSTSWM


Experiment A-1 Transpose LT Performance

These results represent the minimum execution time for each platform, using the optimal parallel implementation, math libraries where available, and the best available compilers. The legends list the platforms in alphabetical order. Results are presented for both MPI-only experiments and best over all communication libraries.

Transpose LT (1) (mpi)
Platform Comparison
  T10L16    T21L8     T21L32    T21L16    T42L16 
  P=16     P=8     P=32     P=16     P=8  
  best platform   sp3-200-nn  sp3-200-nn  sp3-200-nn  sp3-200-nn  sp3-200-nn 
  best timing   .54357E-01  .11385E+00  .17409E+00  .13401E+00  .93774E+00 

Transpose LT (1) (all)
Platform Comparison
  T10L16    T21L8     T21L32    T21L16    T42L16 
  P=16     P=8     P=32     P=16     P=8  
  best platform   t3e900  sp3-200-nn  t3e900  sp3-200-nn  sp3-200-nn 
  best library   shmem  mpi  shmem  mpi  mpi 
  best timing   0.42648E-01  .11385E+00  0.17199E+00  .13401E+00  .93774E+00 


Patrick H. Worley / ( worleyph@ornl.gov)
Last Modified Monday, 15-Jul-2002 09:58:56 EDT.
82194 accesses since 1/2/96.