Method of preventing false or unnecessary failovers in a...

Error detection/correction and fault detection/recovery – Data processing system error or fault handling – Reliability and availability

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Reexamination Certificate

active

06389551

ABSTRACT:

The present invention relates to a system and method for providing a quorum service for high availability clusters to prevent false or unnecessary failovers.
BACKGROUND OF THE INVENTION
Multi-processing systems are commonly configured in clusters of related systems to ensure high availability. These high availability clusters typically require the configuration of one or more heartbeats or communication paths between or among systems. A “heartbeat” is meant to include any type of brief, periodic communication signal which systems send to each other to insure that all systems in the cluster are functional. (Typically, a main system sends a message and the other systems repeat the message back to the main system to check system operability in the cluster.) The failure of all heartbeat mechanisms to a given system indicates that the system is dead (not functioning properly) and at least one of the remaining systems in the cluster needs to initiate a failover of any applications which were running on the affected system.
However, failed cables, failed routers, etc. can give the appearance that a system is not functioning properly when the system is actually still alive and functioning. Since the systems on one side of the network (i.e. one side of the failed cables) cannot communicate with the systems on the other side of the network, false failovers occur. This leads to the possibility of the same application being run on two or more different systems in the network, which can lead to data corruption. The possibility of the same application being run on two or more different systems in the network is especially high in cluster configurations in which there are no redundant heartbeat mechanisms, as well as in wide-area failover or disaster recovery configurations.
There is a need for a system and method which enables each system of a cluster to register with a quorum service which can assist in determining whether a failover is required.
SUMMARY OF THE INVENTION
In accordance with the teachings of the present invention, a system and method of providing a quorum service which each system of a cluster registers with prior to a potential failover to insure proper functionality of the cluster is provided. In particular, a method of preventing false or unnecessary failovers in a high availability cluster due to network failures, wherein said high availability cluster includes a plurality of systems, comprising the steps of providing a quorum service which each of said systems can independently communicate with; sending a registration signal from each system indicating that the system is operational when the failure of any system in the cluster is suspected; initiating shutdown procedures at a particular system if the particular system is unable to send a registration signal to said quorum service; requesting registration status by one or more of the systems other than the particular system that is unable to send a registration signal to said quorum service; and proceeding with failover activities by at least one of the systems other than the particular system that is unable to send a registration signal to said quorum service is provided.


REFERENCES:
patent: 4710926 (1987-12-01), Brown et al.
patent: 4754398 (1988-06-01), Pribnow
patent: 4817091 (1989-03-01), Katzman et al.
patent: 4847837 (1989-07-01), Morales et al.
patent: 4933940 (1990-06-01), Walter et al.
patent: 5383178 (1995-01-01), Unverrich
patent: 5475813 (1995-12-01), Cieslak et al.
patent: 5487148 (1996-01-01), Komori et al.
patent: 5533191 (1996-07-01), Nakano
patent: 5682470 (1997-10-01), Dwork et al.
patent: 5892895 (1999-04-01), Basavaiah
patent: 5909540 (1999-06-01), Carter et al.
patent: 6002851 (1999-12-01), Basavaiah
patent: 6067634 (2000-05-01), Nelson
patent: 6108699 (2000-08-01), Moiin
patent: 6145089 (2000-11-01), Le et al.
patent: 6148410 (2000-11-01), Baskey et al.
patent: 6237113 (2001-05-01), Daiber
patent: 6253335 (2001-06-01), Nakajima et al.
patent: 6314526 (2001-11-01), Arendt et al.
patent: 6327675 (2001-12-01), Burdett et al.
patent: 6330687 (2001-12-01), Griffith
patent: 6330694 (2001-12-01), Hong et al.
patent: 6332202 (2001-12-01), Sheikh et al.
patent: 4-318721 (1991-04-01), None

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method of preventing false or unnecessary failovers in a... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method of preventing false or unnecessary failovers in a..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method of preventing false or unnecessary failovers in a... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2902393

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.