Abstract
In this paper, the partitioning scheme is used to achieve fault tolerance in hyperbus and hypercube multiprocessors. Unlike other schemes, processor faults are assumed to be randomly distributed. We propose a novel and practical load redistribution method to tolerate processor faults in a hyperbus structure with insignificant overhead(a slowdown of 2 for computation and a slowdown of 3 for communication in the worst case). Standard routing and broadcasting algorithms were implemented on hypercube computers. To achieve fault tolerance, we present routing and broadcasting algorithms for a faulty hypercube with at most n-1 faults. Compared with other existing algorithms, our methods have better performance in most measures.
| Original language | English |
|---|---|
| Pages | 340-347 |
| Number of pages | 8 |
| State | Published - 1994 |
| Externally published | Yes |
| Event | Proceedings of the 1994 International Conference on Parallel and Distributed Systems - Hsinchu, China Duration: 19 12 1994 → 21 12 1994 |
Conference
| Conference | Proceedings of the 1994 International Conference on Parallel and Distributed Systems |
|---|---|
| City | Hsinchu, China |
| Period | 19/12/94 → 21/12/94 |