Abstract
A cost-effective way to run a parallel application is to use existing workstations connected by a local area network such as Ethernet or FDDI. In this paper, we present an approach for parallel I/O of multidimensional arrays on small networks of workstations with a shared-media interconnect, using the Panda I/O library. In such an environment, the message passing throughput per node is lower than the throughput obtainable from a fast disk and it is not easy for users to determine the configuration which will yield the best I/O performance. We introduce an I/O strategy that exploits local data to reduce the amount of data that must be shipped across the network, present experimental results, and analyze the results using an analytical performance model and predict the best choice of I/O parameters. Our experiments show that the new strategy results in a factor of 1.2-2.1 speedup in response time compared to the Panda version originally developed for the IBM SP2, depending on the array sizes, distributions and compute and I/O node meshes. Further, the performance model predicts the results within a 13% margin of error.
Original language | English |
---|---|
Pages | 1-13 |
Number of pages | 13 |
DOIs | |
State | Published - 1997 |
Externally published | Yes |
Event | Proceedings of the 1997 5th Workshop on I/O in Parallel and Distributed Systems - San Jose, CA, USA Duration: 17 11 1997 → 17 11 1997 |
Conference
Conference | Proceedings of the 1997 5th Workshop on I/O in Parallel and Distributed Systems |
---|---|
City | San Jose, CA, USA |
Period | 17/11/97 → 17/11/97 |