|
|
 |
|
 |
Volume 34, Number 2, 1995
Scalable Parallel Computing |
|
Table of contents: HTML |
|
|
Copyright info |
 |
 |
 |
 |
| |
|
A scalable implementation of the NAS Parallel Benchmark BT on distributed memory systems |
 |
by V. K. Naik |
 |
 |
 |
 |
|
In this paper, we describe an efficient and scalable implementation
of the NAS Parallel Benchmark BT suitable for distributed memory
systems such as the IBM Scalable POWERparallel Systems®.
After
describing the parallelization and data partitioning methods used, we
outline some of the optimization steps used to realize good performance
on individual processors and to reduce the communication overheads on
the IBM SP1 and SP2 systems. We
present performance results on
up to 128 nodes of the SP1, and on the SP2 with wide nodes. We describe
the performance on the standard Class A and Class B problem sets.
To show the scalability of our parallelization methods, we present the
performance of two additional data sets.
|
 |
 |
|
|