PSTSWM SP3-375 Serial Performance

Performance Studies using

PSTSWM


IBM SP3-375 Winterhawk II Serial Performance

(comparing performance when running multiple instances)

(fixed problem size per processor)

Date/Person: February 18, 2000 / P. Worley
Platform: IBM SP3 at Oak Ridge National Laboratory (eagle.ccs.anl.gov):
     62 4-way Winterhawk II SMP nodes (375 MHz POWER3 with 8MB L2 cache)
Environment: AIX 4.3.3;   PSSP 3.1.1
Code Version: 6.7
Make Options: MACH=sp COMM=mpi PRECISION=8 PERF=n WORKSPACE=20000000 MATH=essl
Compilation Options: mpxlf_r -O3 -qarch=auto -qtune=auto -qcache=auto
Link Options: -bmaxdata:0x70000000
Number of steps: T42: 241 or 481
T85: 49 or 97
T170: 49 or 97
Number of processors per node: 1, 2, 3, or 4
Notes: using ESSL library routines for Fourier transforms and BLAS (-lessl)

T42 MFlop rates

Per Processor Rate

Per Node Rate


T85 MFlop rates

Per Processor Rate

Per Node Rate


T170 MFlop rates

Per Processor Rate

Per Node Rate


DISCUSSION


Patrick H. Worley / ( worleyph@ornl.gov)
Last Modified Monday, 15-Jul-2002 10:19:35 EDT.
3265 accesses since 1/2/96.