Fault tolerance in hyperbus and hypercube multiprocessors using partitioning scheme

Shih Chang Wang*, Sy Yen Kuo

*Corresponding author for this work

Research output: Contribution to conferenceConference Paperpeer-review

2 Scopus citations

Abstract

In this paper, the partitioning scheme is used to achieve fault tolerance in hyperbus and hypercube multiprocessors. Unlike other schemes, processor faults are assumed to be randomly distributed. We propose a novel and practical load redistribution method to tolerate processor faults in a hyperbus structure with insignificant overhead(a slowdown of 2 for computation and a slowdown of 3 for communication in the worst case). Standard routing and broadcasting algorithms were implemented on hypercube computers. To achieve fault tolerance, we present routing and broadcasting algorithms for a faulty hypercube with at most n-1 faults. Compared with other existing algorithms, our methods have better performance in most measures.

Original languageEnglish
Pages340-347
Number of pages8
StatePublished - 1994
Externally publishedYes
EventProceedings of the 1994 International Conference on Parallel and Distributed Systems - Hsinchu, China
Duration: 19 12 199421 12 1994

Conference

ConferenceProceedings of the 1994 International Conference on Parallel and Distributed Systems
CityHsinchu, China
Period19/12/9421/12/94

Fingerprint

Dive into the research topics of 'Fault tolerance in hyperbus and hypercube multiprocessors using partitioning scheme'. Together they form a unique fingerprint.

Cite this