Method for ensuring operation during node failures and...

Error detection/correction and fault detection/recovery – Data processing system error or fault handling – Reliability and availability

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Reexamination Certificate

active

06877107

ABSTRACT:
A means for guaranteeing the proper behavior as specified by the JMS semantics of clustered message server when the individual computer that comprise the cluster are separated by a network partition. A clustered message server is responsible for the reliable transportation of messages between different distributed computer applications. It employs multiple computers to perform a function that otherwise appears to be performed by a monolithic server running on one computer, but with more capacity and reliability than can be provided by one computer. If a computer in the cluster fails, another computer should automatically assume the role of the failed computer. However, it is not possible for the other machines in the cluster to detect the difference between the failure of one or more computers in the cluster, and the failure of data network connecting those computers. In ordinary clusters, different actions would be required in these two cases, but since they are impossible to distinguish, computer failure is always assumed and network failure is ignored and the consequence non-deterministic. The invention described here provides a means of responding to failures that yields correct behavior as specified by the JMS semantics whether the failure is due to computer failure or network failure.

REFERENCES:
patent: 6449734 (2002-09-01), Shrivastava et al.
patent: 6785678 (2004-08-01), Price
patent: 0 853 277 (1998-07-01), None
Scalability of the Microsoft Cluster Service, Werner Vogels, Dan Dumitriu, Ashutosh Agrawal, Teck Chia, Katherine Guo, Department of Computer Science, Cornell University.
The Design and Architecture of the Microsoft Cluster Service, A Practical Approach to High-Availability and Sealability, Werner Vogels, Dan Dumitriu, Ken Birman (Dept. of Computer Science Cornell University).
An Overview of the Galaxy Management Framework for Scalable Enterprise Cluster Computing, Werner Vogels, Dan Dumitriu, Dept. of Computer Science, Cornell University.
Six Misconceptions about Reliable Distributed Computing, Werner Vogels, Robert van Renesse and Ken Birman, Dept. of Computer Science, Cornell University.
Dynamic Routing in SonicMQ 3.0.
A dynamically fault-tolerant and dynamically scalable distributed tuplespace for heterogeneous, loosely coupled networds, A thesis submitted in the partial fulfilment of the requirements for the degree of Candidatus Scientiarum in Computer Science by Jesper Honig Spring.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method for ensuring operation during node failures and... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method for ensuring operation during node failures and..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for ensuring operation during node failures and... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3389586

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.