PSTSWM AlphaSC-667 Serial Performance

Performance Studies using

PSTSWM


Compaq AlphaServer SC Serial Performance

(Using CXML Math Library)

Date/Person: November 15, 2000 / P. Worley
Platform: Compaq AlphaServer SC at Oar Ridge National Laboratory (colt.ccs.ornl.gov):
     16 ES40 4-way SMP nodes (667 MHz Alpha 21264a with 8MB L2 cache)
Environment: Digital UNIX V5.0;   RMS 2.36
Compaq Fortran V5.3-1120
Code Version: 6.7.2
Make Options: MACH=alpha-sc COMM=mpi PRECISION=8 PERF=n WORKSPACE=22000000 MATH=cxml
Compilation Options: f90 -O4 -assume accuracy_sensitive -math_library accurate -align dcommons -align records -arch host -tune host
    or f90 -O4 -assume noaccuracy_sensitive -math_library fast -align dcommons -align records -arch host -tune host
    or f90 -O5 -assume noaccuracy_sensitive -math_library fast -align dcommons -align records -arch host -tune host
Number of steps: T5, T10, T21, T42: 241 or 481
T85: 49 or 97
T170: 49 or 97
Notes: using CXML library routines for Fourier transforms and BLAS

-O4 accurate

MEASURED TIME PER TIMESTEP (SEC)

Problem L1 L2 L3 L16
T5 0.00015 0.00027 0.00041 0.00225
T10 0.00039 0.00079 0.00119 0.00701
T21 0.00170 0.00351 0.00533 0.03211
T42 0.00819 0.01586 0.02741 0.17412
T85 0.04709 0.10235 0.15335 1.12309
T170 0.32361 0.75606 0.99520  

MEASURED MFLOP/SEC RATES

Problem L1 L2 L3 L16
T5 210.8 233.5 229.8 223.2
T10 380.9 375.5 372.7 338.5
T21 451.0 435.8 430.9 381.1
T42 493.3 509.7 442.2 371.3
T85 505.2 464.9 465.5 339.0
T170 465.1 398.1 453.7  

-O4 fast

MEASURED TIME PER TIMESTEP (SEC)

Problem L1 L2 L3 L16
T5 0.00013 0.00027 0.00039 0.00219
T10 0.00040 0.00079 0.00120 0.00707
T21 0.00171 0.00354 0.00538 0.03212
T42 0.00827 0.01593 0.02751 0.17503
T85 0.04735 0.10340 0.15427 1.13454
T170 0.32510 0.75784 0.99955  

MEASURED MFLOP/SEC RATES

Problem L1 L2 L3 L16
T5 233.0 233.3 242.7 229.8
T10 374.9 373.1 371.3 335.6
T21 447.6 432.7 426.5 381.0
T42 488.4 507.4 440.6 369.4
T85 502.4 460.2 462.7 335.5
T170 463.0 397.2 451.7  

-O5 fast

MEASURED TIME PER TIMESTEP (SEC)

Problem L1 L2 L3 L16
T5 0.00015 0.00027 0.00041 0.00224
T10 0.00042 0.00083 0.00125 0.00723
T21 0.00175 0.00358 0.00551 0.03330
T42 0.00820 0.01619 0.02720 0.24963
T85 0.04743 0.10153 0.15200 1.68304
T170 0.32535 0.75561 0.99974  

MEASURED MFLOP/SEC RATES

Problem L1 L2 L3 L16
T5 211.7 232.0 229.3 224.8
T10 353.6 355.3 355.2 328.3
T21 437.7 427.0 416.1 367.5
T42 492.5 499.3 445.7 259.0
T85 501.6 468.7 469.6 226.2
T170 462.6 398.4 451.6  

DISCUSSION


Patrick H. Worley / ( worleyph@ornl.gov)
Last Modified Monday, 15-Jul-2002 10:05:47 EDT.
3339 accesses since 1/2/96.