Error detection/correction and fault detection/recovery – Data processing system error or fault handling – Reliability and availability
Reexamination Certificate
1997-11-03
2001-01-23
Beausoliel, Jr., Robert W. (Department: 2785)
Error detection/correction and fault detection/recovery
Data processing system error or fault handling
Reliability and availability
C709S241000, C709S223000
Reexamination Certificate
active
06178529
ABSTRACT:
FIELD OF THE INVENTION
The invention relates generally to computer network servers, and more particularly to computer servers arranged in a server cluster.
BACKGROUND OF THE INVENTION
A server cluster is a group of at least two independent servers connected by a network and managed as a single system. The clustering of servers provides a number of benefits over independent servers. One important benefit is that cluster software, which is run on each of the servers in a cluster, automatically detects application failures or the failure of another server in the cluster. Upon detection of such failures, failed applications and the like can be quickly restarted on a surviving server, with no substantial reduction in service. Indeed, clients of a Windows NT cluster believe they are connecting with a physical system, but are actually connecting to a service which may be provided by one of several systems. To this end, clients create a TCP/IP session with a service in the cluster using a known IP address. This address appears to the cluster software as a resource in the same group (i.e., a collection of resources managed as a single unit) as the application providing the service. In the event of a failure the cluster service “moves” the entire group to another system.
Other benefits include the ability for administrators to inspect the status of cluster resources, and accordingly balance workloads among different servers in the cluster to improve performance. Dynamic load balancing is also available. Such manageability also provides administrators with the ability to update one server in a cluster without taking important data and applications offline. As can be appreciated, server clusters are used in critical database management, file and intranet data sharing, messaging, general business applications and the like.
A cluster works with a large number of basic system components, known as resource objects, which provide some service to clients in a client/server environment or to other components within the system. Resource objects range from physical devices, such as disks, to purely software constructs, such as processes, databases, and IP addresses.
As can be appreciated, these resource objects are rather disparate in nature. Notwithstanding, the cluster software on each system needs to control and monitor the operation of the resource objects on its systems, regardless of their type. For example, the Windows NT Cluster design provides failure detectors and recovery mechanisms for working with a system's resources. However, because of the widely disparate types of resource objects, the software for resource monitoring heretofore needed to be highly complex so that the cluster was able deal with each type of resource object it was controlling.
SUMMARY OF THE INVENTION
Accordingly, the present invention provides an improved method and system to facilitate the control and monitoring of disparate resources. The method and system enables cluster software to control and monitor resources and handle failure recovery thereof in a simple and common manner, regardless of the type of resource. The method and system is simple to implement, reliable and extensible to future methods and improvements.
Briefly, the present invention provides a method and system in a server cluster for monitoring and controlling a resource object. A resource component such as a dynamic link library is connected to the resource object, such as a physical device or application, for management thereof. A resource monitor preferably connects the resource component to the cluster service. The resource component includes a plurality of methods that are common to the resource components. The methods are called by the resource monitor to control and monitor operation of the resource object through the resource component. In this manner, the cluster service is able to treat all resource objects in the same manner without regard to the type of object.
Other benefits and advantages will become apparent from the following detailed description when taken in conjunction with the drawings, in which:
REFERENCES:
patent: 4736393 (1988-04-01), Grimes et al.
patent: 5021949 (1991-06-01), Morten et al.
patent: 5027269 (1991-06-01), Grant et al.
patent: 5117352 (1992-05-01), Falek
patent: 5128885 (1992-07-01), Janus et al.
patent: 5165018 (1992-11-01), Simor
patent: 5301337 (1994-04-01), Wells et al.
patent: 5341372 (1994-08-01), Kirkham
patent: 5398329 (1995-03-01), Hirata et al.
patent: 5416777 (1995-05-01), Kirkham
patent: 5423037 (1995-06-01), Hvasshovd
patent: 5434865 (1995-07-01), Kirkham
patent: 5435003 (1995-07-01), Chng et al.
patent: 5490270 (1996-02-01), Devarakonda et al.
patent: 5491800 (1996-02-01), Goldsmith et al.
patent: 5537532 (1996-07-01), Chng et al.
patent: 5568491 (1996-10-01), Beal et al.
patent: 5666538 (1997-09-01), DeNicola
patent: 5710727 (1998-01-01), Mitchell et al.
patent: 5715389 (1998-02-01), Komori et al.
patent: 5734899 (1998-03-01), Yoshizawa et al.
patent: 5737601 (1998-04-01), Jain et al.
patent: 5745669 (1998-04-01), Hugard et al.
patent: 5751963 (1998-05-01), Umetsu
patent: 5754752 (1998-05-01), Sheh et al.
patent: 5754877 (1998-05-01), Hagersten et al.
patent: 5757642 (1998-05-01), Jones
patent: 5768523 (1998-06-01), Schmidt
patent: 5768524 (1998-06-01), Schmidt
patent: 5781737 (1998-07-01), Schmidt
patent: 5787247 (1998-07-01), Norin et al.
patent: 5794253 (1998-08-01), Norin et al.
patent: 5805839 (1998-09-01), Singhal
patent: 5806875 (1998-09-01), Jain et al.
patent: 5812779 (1998-09-01), Ciscon et al.
patent: 5815649 (1998-09-01), Utter et al.
patent: 5819019 (1998-10-01), Nelson
patent: 5822532 (1998-10-01), Ikeda
patent: 5832514 (1998-11-01), Norin et al.
patent: 5857073 (1999-01-01), Tsukamoto et al.
patent: 5919247 (1999-07-01), Van Hoff et al.
patent: 5933422 (1999-08-01), Kusano et al.
patent: 5935230 (1999-08-01), Pinai et al.
patent: 5940870 (1999-08-01), Chi et al.
patent: 5946689 (1999-09-01), Yanaka et al.
patent: 5963960 (1999-10-01), Swart et al.
patent: 5966511 (1999-10-01), Temma
patent: 5968121 (1999-10-01), Logan et al.
patent: 5968140 (1999-10-01), Hall
patent: 5982747 (1999-11-01), Ramfelt et al.
patent: 5991771 (1999-11-01), Falls et al.
patent: 5991893 (1999-11-01), Snider
patent: 6003075 (1999-12-01), Arendt et al.
patent: 6044367 (2000-03-01), Wolff
patent: 6047323 (2000-04-01), Krause
patent: 6178912 (2000-06-01), Buerger et al.
Islam et al “Extensible Resource Management for Cluster Computing” Distributed Computing Systems, May 1997.
Chen et al “Designing Mobile Computing Systems Using Distributed Objects” IEEE Communications Magazine, Feb. 1997.
Carr, Richard, “The Tandem Global Update Protocol,”Tandem Systems Review, vol. 1, No. 2, 74-85 (1985).
Lamport, Leslie,A Fast Mutual Exclusion Algorithm, Digital Equipment Corporation, Oct. 31, 1986.
Lamport, Leslie,The Part-Time parliament, Digital Equipment Corporation, Sep. 1, 1989.
Gamache Rod
Massa Michael T.
Short Robert T.
Vert John D.
Beausoliel, Jr. Robert W.
Michalik & Wylie PLLC
Microsoft Corporation
Revak Christopher
LandOfFree
Method and system for resource monitoring of disparate... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and system for resource monitoring of disparate..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and system for resource monitoring of disparate... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2529135