CCM/MP-2D AlphaSC-667 Parallel Algorithm Comparisons

Performance Studies using

CCM/MP-2D


Compaq AlphaServer SC CCM/MP-2D
Parallel Algorithm Comparison

Date/Person: April 24-26, 2000 / P. Worley
Platform: Compaq AlphaServer SC at Oar Ridge National Laboratory (falcon.ccs.ornl.gov):
     64 ES40 4-way SMP nodes (667 MHz Alpha 21264a with 8MB L2 cache)
Environment: Digital UNIX V5.0;   RMS 2.36
Compilation Options: f90 -O4 -assume accuracy_sensitive -math_library fast -align dcommons -align records -arch host -tune host
Math Library: none
Communication Library: MPI
Problem Size: T42L18
Number of Timesteps: 2-720
Results:

2 Processors / Problem T42L18
Optimal Algorithm
minalgorithmaspect rationmapping
  1.3572e+03   t2    1x2    -  

4 Processors / Problem T42L18
Optimal Algorithm
minalgorithmaspect rationmapping
  7.7889e+02   t2    1x4    -  

8 Processors / Problem T42L18
Optimal Algorithm
minalgorithmaspect rationmapping
  3.8190e+02   d1    1x8    -  

16 Processors / Problem T42L18
Optimal Algorithm
minalgorithmaspect rationmapping
  2.7643e+02   d1    1x16    -  

32 Processors / Problem T42L18
Optimal Algorithm
minalgorithmaspect rationmapping
  1.2307e+02   d0    2x16    +1  

64 Processors / Problem T42L18
Optimal Algorithm
minalgorithmaspect rationmapping
  7.7352e+01   t5    4x16    -1  

128 Processors / Problem T42L18
Optimal Algorithm
minalgorithmaspect rationmapping
  4.8569e+01   t5    8x16    -1  


Patrick H. Worley / ( worleyph@ornl.gov)
Last Modified Monday, 15-Jul-2002 10:06:02 EDT.