Abstract
With the great progress of distributed object computing, more and more large systems are built using this technology. Thus fault tolerance for distributed object computing is obviously a significant research domain. The Object Management Group (OMG) had recently published the "Fault Tolerant CORBA Specification V1.0". This specification defines how to achieve fault tolerance for distributed object computing using object group, and failure detection is one of the key elements for fault management. But the specification does not depict much about failure detection and leaves many specific details to venders. In this paper, we propose a simple mechanism for failure detection in distributed object computing. This mechanism is designed to be general rather than application-specific, with no single point of failure, and efficient. While the failure detectors may also crash during operating, we propose a method to handle this condition and to ensure the "no single point of failure" feature. The proposed mechanism has been implemented using CORBA to demonstrate that it works well.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2001 Pacific Rim International Symposium on Dependable Computing, PRDC 2001 |
| Publisher | IEEE Computer Society |
| Pages | 273-280 |
| Number of pages | 8 |
| ISBN (Electronic) | 0769514146 |
| DOIs | |
| State | Published - 2001 |
| Externally published | Yes |
| Event | Pacific Rim International Symposium on Dependable Computing, PRDC 2001 - Seoul, Korea, Republic of Duration: 17 12 2001 → 19 12 2001 |
Publication series
| Name | Proceedings of IEEE Pacific Rim International Symposium on Dependable Computing, PRDC |
|---|---|
| Volume | 2001-January |
| ISSN (Print) | 1541-0110 |
Conference
| Conference | Pacific Rim International Symposium on Dependable Computing, PRDC 2001 |
|---|---|
| Country/Territory | Korea, Republic of |
| City | Seoul |
| Period | 17/12/01 → 19/12/01 |
Bibliographical note
Publisher Copyright:© 2001 IEEE.