Method for monitoring and recovery of subsystems in a distribute

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

39518218, G06F 1100

Patent

active

058057854

ABSTRACT:
A system and method for a general and extensible infrastructure providing monitoring and recovery of interdependent systems in a distributed/clustered system is disclosed. Subsystems, built without provision for high availability, are incorporated into the infrastructure without modification to core subsystem function. The infrastructure is comprised of one or more computing nodes connected by one or more interconnection networks, and running one or more distributed subsystems. The infrastructure monitors the computing nodes using one or more heartbeat and membership protocols, and monitors the said distributed subsystems by subsystem-specific monitors. Events detected by monitors are sent to event handlers. Event handlers process events by filtering them through activities such as event correlation, removal of duplicates, and rollup. Filtered events are given by Event Managers to Recovery Drivers which determine the recovery program corresponding to the event, and executing the recovery program or set of recovery actions by coordination among the recovery managers. Given failures of said event handlers or recovery managers, the infrastructure performs the additional steps of: coordinating among remaining event handlers and recovery managers to handle completion or termination of ongoing recovery actions, discovering the current state of the system by resetting the said monitors, and handling any new failure events that may have occurred in the interim.

REFERENCES:
patent: Re34100 (1992-10-01), Hartness
patent: 4480304 (1984-10-01), Carr et al.
patent: 4627055 (1986-12-01), Mori et al.
patent: 4807224 (1989-02-01), Naron et al.
patent: 4945474 (1990-07-01), Elliot et al.
patent: 4984240 (1991-01-01), Keren-Zvi et al.
patent: 5084816 (1992-01-01), Boese et al.
patent: 5243601 (1993-09-01), Tague et al.
patent: 5258984 (1993-11-01), Menon et al.
patent: 5295258 (1994-03-01), Jewett et al.
patent: 5307354 (1994-04-01), Crammer et al.
patent: 5333308 (1994-07-01), Ananthanpillai
patent: 5333314 (1994-07-01), Masai et al.
patent: 5349662 (1994-09-01), Johnson et al.
patent: 5355484 (1994-10-01), Record et al.
patent: 5408649 (1995-04-01), Beshears et al.
patent: 5440726 (1995-08-01), Fuchs et al.
patent: 5440741 (1995-08-01), Morales et al.
patent: 5475839 (1995-12-01), Watson et al.
patent: 5528750 (1996-06-01), Lubart et al.
patent: 5592664 (1997-01-01), Starkey
patent: 5608908 (1997-03-01), Barghouti et al.
patent: 5621892 (1997-04-01), Cook
patent: 5625821 (1997-04-01), Record et al.
patent: 5630047 (1997-05-01), Wang

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method for monitoring and recovery of subsystems in a distribute does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method for monitoring and recovery of subsystems in a distribute, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for monitoring and recovery of subsystems in a distribute will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-1292481

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.