PSTSWM SPP-2000 Point-to-Point Communication Performance

Performance Studies using

PSTSWM


HP/Convex SPP-2000 SWAP Performance

(ordered swap of 2MB message using MPI)

Date/Person: April 26, 1998 / P. Worley
Platform: HP/Convex SPP-2000 at National Center for Supercomputer Applications (billie.ncsa.uiuc.edu):
   64 180-MHz HP PA-RISC 8000 processors (4 Hypernodes)
Environment: SPP-UX_ail 5.2.1 L34
SPP-UX_mk 5.2.1.139
HP MPI 01.03.01.00 (11/21/97) B5880 - SPP-UX 5.x
Communication Library: MPI
SWAP size: 262144 REAL*8 floating point values each direction
Message size: Largest - 262144 REAL*8 floating point values
Smallest - 256 REAL*8 floating point values
Processors: 0 and 1
Latency Definition:(T1024-T512)/512
Results:

ordered simple swap
Data Statistics
unidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 540.64 20.19 43.8%
10 iter. 826.53 18.83 56.6%

ordered swap using nonblocking send
Data Statistics
unidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 445.92 24.52 41.3%
10 iter. 575.97 22.43 55.9%
1 iter. w/overlap 365.74 26.39 45.0%
10 iter. w/overlap 440.48 24.46 55.7%

ordered swap using nonblocking receive
Data Statistics
unidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 443.42 24.75 41.6%
10 iter. 576.76 24.30 55.4%
1 iter. w/overlap 439.01 26.31 48.9%
10 iter. w/overlap 450.51 23.60 56.5%

ordered swap using nonblocking send and receive
Data Statistics
unidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 447.58 24.75 42.1%
10 iter. 576.47 24.10 55.5%
1 iter. w/overlap 364.66 31.92 39.3%
10 iter. w/overlap 434.48 25.20 55.1%

ordered swap using ready send
Data Statistics
unidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 446.92 31.85 39.8%
10 iter. 564.50 30.83 52.8%
1 iter. w/overlap 438.78 26.54 48.7%
10 iter. w/overlap 448.21 23.37 56.2%

ordered swap using nonblocking ready send
Data Statistics
unidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 445.49 32.14 39.5%
10 iter. 575.37 29.67 53.8%
1 iter. w/overlap 366.19 28.84 41.0%
10 iter. w/overlap 440.66 25.66 55.1%

synchronous
Data Statistics
unidirectional bandwidth estimated latency model error
(peak MByte/sec) (usec/msg) (max. rel. error)
1 iter. 524.74 21.01 42.5%
10 iter. 801.73 21.12 54.9%

DISCUSSION


Patrick H. Worley / ( worleyph@ornl.gov)
Last Modified Monday, 15-Jul-2002 10:22:49 EDT.
86222 accesses since 1/2/96.