Error detection/correction and fault detection/recovery – Data processing system error or fault handling – Reliability and availability
Reexamination Certificate
1998-03-10
2001-11-13
Le, Dieu-Minh T. (Department: 2181)
Error detection/correction and fault detection/recovery
Data processing system error or fault handling
Reliability and availability
C714S005110
Reexamination Certificate
active
06317844
ABSTRACT:
BACKGROUND OF THE INVENTION
1. Field of the Invention
The invention relates to storage systems.
2. Related Art
Computer storage systems are used to record and retrieve data. In some computer systems, storage systems communicate with a set of client devices, and provide services for recording and retrieving data to those client devices. Because data storage is important to many applications, it is desirable for the services and data provided by the storage system to be available for service to the greatest degree possible. It is therefore desirable to provide storage systems that can remain available for service even in the face of component failures in the storage system.
One known technique for providing storage systems that can remain available for service is to provide a plurality of redundant storage elements, with the property that when a first storage element fails, a second storage element is available to provide the services and the data otherwise provided by the first. Transfer of the function of providing services from the first to the second storage element is called “failover.” The second storage element maintains a copy of the data maintained by the first, so that failover can proceed without substantial interruption.
A first known technique for achieving failover is to cause the second storage element to copy all the operations of the first. Thus, each storage operation completed by the first storage element is also completed by the second. This first known technique is subject to drawbacks: (1) It uses a substantial amount of processing power at the second storage element duplicating efforts of the first, most of which is wasted. (2) It slows the first storage element in confirming completion of operations, because the first storage element waits for the second to also complete the same operations.
A second known technique for achieving failover is to identify a sequence of checkpoints at which the first storage element is at a consistent and known state. On failover, the second storage element can continue operation from the most recent checkpoint. For example, the NFS (Network File System) protocol requires all write operations to be stored to disk before they are confirmed, so that confirmation of a write operation indicates a stable file system configuration. This second known technique is subject to drawbacks: (1) It slows the first storage element in performing write operations, because the first storage element waits for write operations to be completely stored to disk. (2) It slows recovery on failover, because the second storage element addresses any inconsistencies left by failure of the first between identified checkpoints.
Accordingly, it would be advantageous to provide a storage system, and a method for operating a storage system, that efficiently uses all storage system elements, quickly completes and confirms operations, and quickly recovers from failure of any storage element. This advantage is achieved in an embodiment of the invention in which the storage system implements frequent and rapid checkpoints, and in which the storage system rapidly distributes duplicate commands for those operations between checkpoints among its storage elements.
SUMMARY OF THE INVENTION
The invention provides a storage system that is highly available even in the face of component failures in the storage system, and a method for operating that storage system. A first and a second file server each includes a file server request log for storing incoming file server requests. Both the first and second file servers have access to a common set of mass storage elements. Each incoming file server request is copied to both the first and second file servers; the first file server processes the file server request while the second file server maintains a copy in its file server request log. Each file server operates using a file system that maintains consistent state after each file server request. On failover, the second file server can perform those file server requests in its file server request log since the most recent consistent state.
In a second aspect of the invention, a file server system provides mirroring of one or more mass storage elements. Each incoming file server request is copied to both the first file server and the second file server. The first file server performs the file server requests to modify a primary set of mass storage elements, and also performs the same file server requests to modify a mirror set of mass storage elements. The mirror mass storage elements are disposed physically separately from the primary mass storage elements, such as at another site, and provide a resource in the event the entire primary set of mass storage elements is to be recovered.
REFERENCES:
patent: H1860 (2000-09-01), Asthana et al.
patent: 3813529 (1974-05-01), Bartlett
patent: 3893024 (1975-07-01), Reins et al.
patent: 4075691 (1978-02-01), Davis et al.
patent: 4075704 (1978-02-01), O'Leary
patent: 4156907 (1979-05-01), Rawlings
patent: 4333144 (1982-06-01), Whiteside
patent: 4351023 (1982-09-01), Richer
patent: 4377843 (1983-03-01), Garringer
patent: 4399503 (1983-08-01), Hawley
patent: 4456957 (1984-06-01), Schieltz
patent: 4459664 (1984-07-01), Pottier
patent: 4488231 (1984-12-01), Yu et al.
patent: 4494188 (1985-01-01), Nakane
patent: 4527232 (1985-07-01), Bechtolsheim
patent: 4550368 (1985-10-01), Bechtolsheim
patent: 4589067 (1986-05-01), Porter et al.
patent: 4620292 (1986-10-01), Hagiwara
patent: 4685125 (1987-08-01), Zave
patent: 4710868 (1987-12-01), Cocke et al.
patent: 4719569 (1988-01-01), Ludemann
patent: 4742447 (1988-05-01), Duvall et al.
patent: 4742450 (1988-05-01), Duvall et al.
patent: 4761737 (1988-08-01), Duvall et al.
patent: 4761785 (1988-08-01), Clark et al.
patent: 4766534 (1988-08-01), DeBenedicts
patent: 4780821 (1988-10-01), Crossley
patent: 4783730 (1988-11-01), Fischer
patent: 4803621 (1989-02-01), Kelly
patent: 4819159 (1989-04-01), Shipley et al.
patent: 4825354 (1989-04-01), Agrawal et al.
patent: 4827411 (1989-05-01), Arrowood
patent: 4845609 (1989-07-01), Lighthart et al.
patent: 4875159 (1989-10-01), Cary et al.
patent: 4887204 (1989-12-01), Johnson et al.
patent: 4897781 (1990-01-01), Chang et al.
patent: 4914583 (1990-04-01), Weisshaar
patent: 4965772 (1990-10-01), Daniel et al.
patent: 4969118 (1990-11-01), Montoye et al.
patent: 4984272 (1991-01-01), McIlroy et al.
patent: 5001628 (1991-03-01), Johnson et al.
patent: 5001712 (1991-03-01), Slpett et al.
patent: 5008786 (1991-04-01), Thattle
patent: 5018144 (1991-05-01), Corr et al.
patent: 5043871 (1991-08-01), Nishigaki
patent: 5043876 (1991-08-01), Terry
patent: 5049873 (1991-09-01), Robins et al.
patent: 5107500 (1992-04-01), Wakamoto
patent: 5113442 (1992-05-01), Moir
patent: 5134619 (1992-07-01), Henson et al.
patent: 5144659 (1992-09-01), Jones
patent: 5146588 (1992-09-01), Crater et al.
patent: 5155835 (1992-10-01), Belsan
patent: 5163131 (1992-11-01), Row et al.
patent: 5163148 (1992-11-01), Walls
patent: 5182805 (1993-01-01), Campbell
patent: 5195100 (1993-03-01), Katz et al.
patent: 5202983 (1993-04-01), Orita et al.
patent: 5208813 (1993-05-01), Stallmo
patent: 5218695 (1993-06-01), Noveck et al.
patent: 5218696 (1993-06-01), Baird et al.
patent: 5222217 (1993-06-01), Blount et al.
patent: 5235601 (1993-08-01), Stallmo et al.
patent: 5251308 (1993-10-01), Frank
patent: 5255270 (1993-10-01), Yanai et al.
patent: 5261044 (1993-11-01), Dev et al.
patent: 5261051 (1993-11-01), Masden et al.
patent: 5274799 (1993-12-01), Brant et al.
patent: 5274807 (1993-12-01), Hoshen et al.
patent: 5276840 (1994-01-01), Yu
patent: 5276867 (1994-01-01), Kenley et al.
patent: 5283830 (1994-02-01), Hinsley et al.
patent: 5297265 (1994-03-01), Frank et al.
patent: 5305326 (1994-04-01), Solomon et al.
patent: 5313626 (1994-05-01), Jones et al.
patent: 5313646 (1994-05-01), Hendricks
patent: 5313647 (1994-05-01), Kaufman
patent: 5315602 (1994-05-01), Noya et al.
patent: 5317731 (1994-05-01), Dias et al.
patent: 5333305 (1994-07-01), Neufeld
patent: 5335235 (1994-08-01), Arnott
paten
Le Dieu-Minh T.
Network Appliance Inc.
Swernofsky Law Group
LandOfFree
File server storage arrangement does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with File server storage arrangement, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and File server storage arrangement will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2610948