PSTSWM Origin2000 Point-to-Point Communication Performance

Performance Studies using

PSTSWM


SGI Origin2000 SWAP Performance

(unordered swap of 2MB message using SHMEM)

(performance measured per processor when all processors in node communicating)

Date/Person: September 30, 1999 / P. Worley
Platform: SGI Origin2000 at Los Alamos National Laboratory:
   128 250-MHz MIPS R10000 processors
Environment: IRIX 6.5
mpt_1.3.0.0
MIPSpro Compilers: Version 7.2.1
Communication Library: SHMEM
SWAP size: 262144 REAL*8 floating point values each direction
Message size: Largest - 262144 REAL*8 floating point values
Smallest - 256 REAL*8 floating point values
Processors: 0 and 2
1 and 3
Latency Definition:(T1024-T512)/512
Results:

unordered swap using nonblocking send
Data Statistics
bidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 189.16 7.04 1.4%
10 iter. 842.76 8.17 21.6%
1 iter. w/overlap 194.26 6.72 3.2%
10 iter. w/overlap 856.02 6.07 19.0%

unordered swap using nonblocking receive
Data Statistics
bidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 203.31 7.45 3.5%
10 iter. 824.73 8.31 22.3%
1 iter. w/overlap 202.81 8.48 3.2%
10 iter. w/overlap 915.69 4.81 20.0%

DISCUSSION


Patrick H. Worley / ( worleyph@ornl.gov)
Last Modified Monday, 15-Jul-2002 10:07:41 EDT.
86318 accesses since 1/2/96.