PSTSWM Origin2000 Point-to-Point Communication Performance

Performance Studies using

PSTSWM


SGI Origin2000 SWAP Performance

(unordered swap of 8KB message using SHMEM)

(performance measured per processor when all processors in node communicating)

Date/Person: September 30, 1999 / P. Worley
Platform: SGI Origin2000 at Los Alamos National Laboratory:
   128 250-MHz MIPS R10000 processors
Environment: IRIX 6.5
mpt_1.3.0.0
MIPSpro Compilers: Version 7.2.1
Communication Library: SHMEM
SWAP size: 1024 REAL*8 floating point values each direction
Message size: Largest - 1024 REAL*8 floating point values
Smallest - 1 REAL*8 floating point values
Processors: 0 and 2
1 and 3
Latency Definition:(T1024-T512)/512
Results:

unordered swap using nonblocking send
Data Statistics
bidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 82.91 8.52 7.3%
10 iter. 464.40 8.17 23.2%
1 iter. w/overlap 72.62 8.10 26.0%
10 iter. w/overlap 464.40 6.19 17.5%

unordered swap using nonblocking receive
Data Statistics
bidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 93.94 8.46 11.2%
10 iter. 493.49 8.23 24.8%
1 iter. w/overlap 88.28 8.20 15.1%
10 iter. w/overlap 544.68 4.60 15.3%

DISCUSSION


Patrick H. Worley / ( worleyph@ornl.gov)
Last Modified Monday, 15-Jul-2002 10:07:44 EDT.
86350 accesses since 1/2/96.