PSTSWM Origin2000 Point-to-Point Communication Performance

Performance Studies using

PSTSWM


SGI Origin2000 SWAP Performance

(ordered swap of 8KB message using SHMEM)

Date/Person: September 30, 1999 / P. Worley
Platform: SGI Origin2000 at Los Alamos National Laboratory:
   128 250-MHz MIPS R10000 processors
Environment: IRIX 6.5
mpt_1.3.0.0
MIPSpro Compilers: Version 7.2.1
Communication Library: SHMEM
SWAP size: 1024 REAL*8 floating point values each direction
Message size: Largest - 1024 REAL*8 floating point values
Smallest - 1 REAL*8 floating point values
Processors: 0 and 1
Latency Definition:(T1024-T512)/512
Results:

ordered swap using nonblocking send
Data Statistics
unidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 68.96 4.49 8.8%
10 iter. 341.33 4.47 18.6%
1 iter. w/overlap 58.85 5.03 20.4%
10 iter. w/overlap 329.26 3.85 15.5%

ordered swap using nonblocking receive
Data Statistics
unidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 82.58 4.61 10.3%
10 iter. 371.01 4.48 20.3%
1 iter. w/overlap 76.99 4.58 8.4%
10 iter. w/overlap 339.07 2.93 24.3%

ordered swap using nonblocking send and receive
Data Statistics
unidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 68.27 4.32 8.5%
10 iter. 337.40 4.31 17.7%
1 iter. w/overlap 60.77 4.26 11.0%
10 iter. w/overlap 323.54 4.37 17.3%

ordered swap using ready send
Data Statistics
unidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 81.59 4.37 19.3%
10 iter. 369.68 4.40 19.9%
1 iter. w/overlap 71.86 4.38 14.8%
10 iter. w/overlap 351.29 4.51 19.3%

DISCUSSION


Patrick H. Worley / ( worleyph@ornl.gov)
Last Modified Monday, 15-Jul-2002 10:07:40 EDT.
86507 accesses since 1/2/96.