PSTSWM Paragon Communication Protocol Performance

Performance Studies using

PSTSWM


Intel Paragon Protocol Performance

(transpose FFT experiment A2 / O(log P) transpose algorithm)

Date/Person: June 4, 1998 / P. Worley
Platform: Intel Paragon XP/S 150 MP at Oak Ridge National Laboratory:
     1024 MP nodes (3 50-MHz iPSC/860 processors per node)
Environment: Paragon OSF/1 Release 1.0.4 Server 1.4 R1_4_5
f77/Paragon Paragon Version R5.0.3
Code Version: 6.3
Compilation Options: if77 -O4 -Mnodepchk -Knoieee -Msafealloc
Math Library: none
Communication Library: NX
Parallel Algorithm: logtrans
Partition: 4x2, 4x4, or 8x4
Results:

16x1 Processors / Problem T10L16
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  5.5017e-01   0.02   0.02   0.05 
Three Fastest
Protocols
1st2nd3rd
  b6   d2   d6 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  9   23   24 

32x1 Processors / Problem T10L16
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  7.5374e-01   0.02   0.02   0.03 
Three Fastest
Protocols
1st2nd3rd
  c2   d2   b6 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  6   24   24 

8x1 Processors / Problem T21L8
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  1.7391e+00   0.02   0.02   0.05 
Three Fastest
Protocols
1st2nd3rd
  b5   d6   d4 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  8   23   24 

16x1 Processors / Problem T21L16
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  1.9200e+00   0.02   0.02   0.06 
Three Fastest
Protocols
1st2nd3rd
  b5   b4   d2 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  8   20   24 

32x1 Processors / Problem T21L32
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  2.2151e+00   0.02   0.02   0.06 
Three Fastest
Protocols
1st2nd3rd
  d2   d3   b6 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  8   20   24 

8x1 Processors / Problem T42L16
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  1.6226e+01   0.02   0.02   0.04 
Three Fastest
Protocols
1st2nd3rd
  b5   d6   b4 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  8   24   24 

DISCUSSION

The Paragon processor grid partitions were chosen to match those used for the October, 1994 SUNMOS data.

Patrick H. Worley / ( worleyph@ornl.gov)
Last Modified Monday, 15-Jul-2002 10:29:42 EDT.