IBM Skip to main content
  Home     Products & services     Support & downloads     My account  
  Select a country  
Journals Home  
  Systems Journal  
  ·  Current Issue  
  ·  Recent Issues  
  ·  Papers in Progress  
  ·  Search/Index  
  ·  Orders  
  ·  Description  
  ·  Author's Guide  
Journal of Research
and Development
  Staff  
  Contact Us  
Systems Journal  
Volume 34, Number 2, 1995
Scalable Parallel Computing
 Table of contents: arrowHTML      DOI: 10.1147/sj.342.0144 arrowCopyright info
   

Preface

The growth of technical and commercial applications for high-performance computing, combined with the decline in the price of computers overall and the need to deliver new capabilities quickly, has led to the emergence of parallel computers and parallel computing. The newest of the desired applications include digital libraries, transaction processing, decision support, advanced multimedia, high-bandwidth networks, and data mining. Current high-performance applications from business and science also need a path to effective use of parallel computing. IBM has addressed this broad new territory with solutions that are general purpose and scalable and that operate readily within existing technical and commercial computing environments. These solutions are built on the firm foundation of RISC System/6000* workstation microprocessors with the AIX* operating system or on the well-established System/390* enterprise servers. The former solution--the IBM Scalable POWERparallel Systems*--is the focus of this special issue.

The availability of this new family of parallel systems has led to the resurgence of existing applications that previously ran only on mainframes and to the development of new applications in science and business that were not practical before. This issue contains an introductory essay and eight papers on the hardware, software, and systems that comprise the POWERparallel* SP1* and SP2* environments. We are indebted to Z. Barzilai and J. Moulic of the Thomas J. Watson Research Center, IBM Research Division, in Yorktown Heights, New York, for their efforts in laying the groundwork for this issue. We are also indebted to M. Snir and G. S. Almasi, also of the Thomas J. Watson Research Center, for their coordination and development of these papers.

The creation of new technologies, approaches, systems, and user opportunities is a mix of technical advances and customer needs, some of which are available or anticipated and some not. In an essay intended to serve as an introduction, Wladawsky-Berger relates the recent history of the technology and user needs that made the IBM Scalable POWERparallel Systems both practical for IBM to build on a short schedule and advantageous for customers to have. Fundamentally, the approach augments the family of systems based on RISC System/6000 and AIX, so as to build on and add to the powerful technologies already developed and provide a scalable suite of systems solutions for customers, whether or not they already use those technologies. (Reprint Order No. G321-5562)

The technology and architecture that form the more recent and advanced SP2 scalable parallel system are described in a paper by Agerwala et al. Seven principles for development of the SP2 are presented as a focus for understanding the evolution from scientifically oriented parallel systems to ones that deal effectively with both scientific and commercial applications. The key systems alternatives, competitive choices, SP2 choices, and rationale are explained. The authors also provide a system overview, explain the system components, and briefly discuss system performance using wellknown benchmarks. (Reprint Order No. G321-5563)

In order to bind a collection of processors that share work and communicate effectively at high speeds, there must be a high-performance switch. As parallel systems have become more powerful and flexible, such switches have evolved into sophisticated components, with high bandwidth and reliability concerns of their own, plus responsibility for relieving the processors of as many communication tasks as possible. The paper by Stunkel et al. presents the SP2 High-Performance Switch and its architecture, topology, adapter approach, performance, and support software. (Reprint Order No. G321-5564)

The effective use of parallel computing systems depends on software that takes advantage of the multiple, communicating processors. Snir et al. describe the software development and execution capabilities created for the SP2 that allow programmers to exploit the power and flexibility of the parallel programming model with minimal or no code dependence on the specific SP2 system used for execution. In particular, the authors present the new Message-Passing Library that supports the communications paradigm of the SP2 and the new IBM Parallel Operating Environment that supports parallel program development and execution on the SP2. (Reprint Order No. G321-5565)

Software developers for parallel computing also require a file system that allows effective use of resources in parallel and provides file services for the entire parallel system. The file system for the SP family, known as the AIX Parallel I/O File System, provides parallel access to partitioned files through portability, large file support, ease of use, and reliability. The authors, Corbett et al., provide the technical rationale and the developmental history for this file system, largely built on the Vesta file system project. (Reprint Order No. G321-5566)

Gropp and Lusk relate their experiences migrating to and using an early 128-node SP1 at the Argonne National Laboratory and the subsequent migration to SP2 software running on SP1 nodes. Their conclusions provide useful information for other users and feedback for SP developers. They stress the speed with which the migrations can be accomplished--in days--and the effective coexistence of high performance, parallelism, and portability. (Reprint Order No. G321-5567)

The next two papers focus on the effectiveness of the SP2 as demonstrated through implementations of the Numerical Aerodynamic Simulation (NAS) benchmarks. These benchmarks were developed by the National Aeronautics and Space Administration (NASA) specifically to test parallel supercomputing environments. In the first paper, Agarwal et al. show the parallel implementation of the five NAS kernel benchmarks. The results are compared with those of three other scalable parallel systems, revealing that the SP2 outperforms the others, often by wide margins. (Reprint Order No. G321-5568)

In the second paper on NAS benchmarks for the SP2, Naik describes the parallel implementation of the NAS Parallel Benchmark BT (Block Tridiagonal), which is designed to test the effectiveness of distributed memory systems. These tests, compared with those for two other systems, show results at the top of the range and almost always exceeding results for the others. They also show the delivered power in various situations, measured in billions of floating-point instructions per second (GFLOPS). (Reprint Order No. G321-5569)

One aspect of high-performance systems that is especially important for commercial applications is the availability of equally high-performance database management systems. In this paper, Baru et al. describe DB2* Parallel Edition, the DB2 product that answers this requirement, executing all vital functions in parallel. The architecture and execution model of DB2 Parallel Edition provide processor and disk storage scalability, large database capacity, query optimization, execution time optimization, favorable transaction properties, parallel utilities, and reorganization for load balancing. Performance figures are shown for various complex queries within SP2 operating environments. (Reprint Order No. G321-5570)

We are pleased to announce that, as a service to authors, we now provide an informational page for authors just ahead of the inside back cover. Also, as a service to readers, we recently expanded and reorganized our information about subscriptions and electronic access, and moved it to the inside back cover.

The next issue of the Journal will be a special issue on computer networking.

Gene F. Hoffnagle
Editor

*Trademark or registered trademark of International Business Machines Corporation.

(C) Copyright 1995 by International Business Machines Corporation.