PSTSWM Paragon Communication Protocol Performance

Performance Studies using

PSTSWM


Intel Paragon Protocol Performance

(transpose LT experiment II-A2 / O(P) sendrecv transpose algorithm)

Date/Person: June 4, 1998 / P. Worley
Platform: Intel Paragon XP/S 150 MP at Oak Ridge National Laboratory:
     1024 MP nodes (3 50-MHz iPSC/860 processors per node)
Environment: Paragon OSF/1 Release 1.0.4 Server 1.4 R1_4_5
f77/Paragon Paragon Version R5.0.3
Code Version: 6.3
Compilation Options: if77 -O4 -Mnodepchk -Knoieee -Msafealloc
Math Library: none
Communication Library: NX
Parallel Algorithm: srtrans
Partition: 4x2, 4x4, or 8x4
Number of Timesteps: 12
Results:

1x16 Processors / Problem T21L2
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  2.6417e-01   0.12   0.10   0.33 
Three Fastest
Protocols
1st2nd3rd
  c2   e2   e3 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  1   8   25 

1x32 Processors / Problem T42L1
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  3.6447e-01   0.21   0.17   0.57 
Three Fastest
Protocols
1st2nd3rd
  e3   e1   e2 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  1   4   19 

1x8 Processors / Problem T42L2
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  1.6553e+00   0.01   0.01   0.02 
Three Fastest
Protocols
1st2nd3rd
  d2   c2   c3 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  22   29   29 

1x16 Processors / Problem T85L1
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  2.2553e+00   0.01   0.01   0.04 
Three Fastest
Protocols
1st2nd3rd
  e3   c3   c2 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  9   29   29 

1x32 Processors / Problem T85L2
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  2.3546e+00   0.03   0.03   0.09 
Three Fastest
Protocols
1st2nd3rd
  e3   e2   c2 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  3   22   29 

1x8 Processors / Problem T85L4
Runtime Statistics
min(mean-min)/min(median-min)/min(max-min)/min
  1.9400e+01   0.01   0.01   0.02 
Three Fastest
Protocols
1st2nd3rd
  d6   b6   b5 
       Number of Proctocols With
Runtimes Within X% of Min
1%5%25%
  25   29   29 

DISCUSSION

The Paragon processor grid partitions were chosen to match those used for the October, 1994 SUNMOS data.

Patrick H. Worley / ( worleyph@ornl.gov)
Last Modified Monday, 15-Jul-2002 10:30:09 EDT.