Error detection/correction and fault detection/recovery – Data processing system error or fault handling – Reliability and availability
Reexamination Certificate
2005-04-05
2005-04-05
Bonzo, Bryce P. (Department: 2114)
Error detection/correction and fault detection/recovery
Data processing system error or fault handling
Reliability and availability
Reexamination Certificate
active
06877107
ABSTRACT:
A means for guaranteeing the proper behavior as specified by the JMS semantics of clustered message server when the individual computer that comprise the cluster are separated by a network partition. A clustered message server is responsible for the reliable transportation of messages between different distributed computer applications. It employs multiple computers to perform a function that otherwise appears to be performed by a monolithic server running on one computer, but with more capacity and reliability than can be provided by one computer. If a computer in the cluster fails, another computer should automatically assume the role of the failed computer. However, it is not possible for the other machines in the cluster to detect the difference between the failure of one or more computers in the cluster, and the failure of data network connecting those computers. In ordinary clusters, different actions would be required in these two cases, but since they are impossible to distinguish, computer failure is always assumed and network failure is ignored and the consequence non-deterministic. The invention described here provides a means of responding to failures that yields correct behavior as specified by the JMS semantics whether the failure is due to computer failure or network failure.
REFERENCES:
patent: 6449734 (2002-09-01), Shrivastava et al.
patent: 6785678 (2004-08-01), Price
patent: 0 853 277 (1998-07-01), None
Scalability of the Microsoft Cluster Service, Werner Vogels, Dan Dumitriu, Ashutosh Agrawal, Teck Chia, Katherine Guo, Department of Computer Science, Cornell University.
The Design and Architecture of the Microsoft Cluster Service, A Practical Approach to High-Availability and Sealability, Werner Vogels, Dan Dumitriu, Ken Birman (Dept. of Computer Science Cornell University).
An Overview of the Galaxy Management Framework for Scalable Enterprise Cluster Computing, Werner Vogels, Dan Dumitriu, Dept. of Computer Science, Cornell University.
Six Misconceptions about Reliable Distributed Computing, Werner Vogels, Robert van Renesse and Ken Birman, Dept. of Computer Science, Cornell University.
Dynamic Routing in SonicMQ 3.0.
A dynamically fault-tolerant and dynamically scalable distributed tuplespace for heterogeneous, loosely coupled networds, A thesis submitted in the partial fulfilment of the requirements for the degree of Candidatus Scientiarum in Computer Science by Jesper Honig Spring.
Giotta Paul
Spring Jesper Honig
Bonzo Bryce P.
Rankin, Hill Porter & Clark LLP
Softwired AG
LandOfFree
Method for ensuring operation during node failures and... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method for ensuring operation during node failures and..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for ensuring operation during node failures and... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3389586