Updating data and parity data with and without read caches

Error detection/correction and fault detection/recovery – Data processing system error or fault handling – Reliability and availability

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Reexamination Certificate

active

06446220

ABSTRACT:

CROSS-REFERENCE TO RELATED APPLICATIONS
This application is related to the following co-pending and commonly-assigned patent applications, all of which are filed on the same date herewith, and all of which are incorporated herein by reference in their entirety:
“Distributed Storage System Using Front-End And Back-End Locking,” by Jai Menon, Divyesh Jadav, Kal Voruganti, Ser. No. 09/124,004;
“System for Updating Data in a Multi-Adaptor Environment,” by Jai Menon, Divyesh Jadav, Deepak Kenchammana-Hosekote, Ser. No. 09/128,574;
“System For Changing The Parity Structure Of A Raid Array,” by Jai Menon, Divyesh Jadav, Deepak Kenchammana-Hosekote, Ser. No. 09/129,012;
“Updating And Reading Data And Parity Blocks In A Shared Disk System,” by Jai Menon, Ser. No. 09/129,067; and
“Updating and Reading Data and Parity Blocks in a Shared Disk System with Request Forwarding,” by Jai Menon and Divyesh Jadav, Ser. No. 09/128,754.
BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention relates to a system for updating data and parity data in a shared disk system.
2. Description of the Related Art
In Redundant Arrays of Independent Disks (RAID) systems, data files and related parity are striped across multiple disk drives. In storage subsystems which manage numerous hard disk drives as a single direct access storage device (DASD), the RAID logic is implemented in the controller of the subsystem. RAID storage methodologies have also been implemented in software for execution on a single host computer. This allows the single host computer, such as a personal computer, to implement RAID storage techniques on local hard disk drive space. Such software RAID methodologies are described in “Algorithms for Software and Low Cost Hardware RAIDs,” by Jai Menon, Jeff Reigel, and Jim Wyllie, Document No. 1063-6390/95, pgs. 411-418 (IEEE 1995), which is incorporated herein by reference in its entirety.
One problem with the single storage subsystem is the risk of failure. Techniques have been developed to improve failback and recovery in case of failures in the hardware controller. One such failback technique is the Fast Write Technique which provides two separate controllers on different power boundaries that control the flow of data from host systems to DASDs. If one controller fails, the other controller can continue writing data to the DASD. Typically a non-volatile storage unit (NVS) is included with each separate controller, such that each NVS connected to a controller backs up the data the other controller is writing to DASD. Such failback systems employing the two-controller failsafe structure are described in U.S. Pat. Nos. 5,636,359, 5,437,022, 5,640,530, and 4,916,605, all of which are assigned to International Business Machines, Corporation (IBM), the assignee of the subject application, and all of which are incorporated herein by reference in their entirety.
RAID systems can also be implemented in a parallel computing architecture in which there is no central controller. Instead, a plurality of independent controllers that control local hard disk storage devices are separate nodes that flnction together in parallel to implement RAID storage methodologies across the combined storage space managed by each node. The nodes are connected via a network. Parity calculations can be made at each node, and not centrally. Such parallel RAID architecture is described in “The TickerTAIP Parallel RAID Architecture,” by Pei Cao, Swee Boon Lim, Shivakumar Venkatarman, and John Wilkes, published in ACM Transactions on Computer Systems, Vol. 12, No. 3, pgs. 236-269 (August, 1994), which is incorporated herein by reference in its entirety.
One challenge in shared disk systems implementing a parallel, shared disk RAID architecture is to provide a system for insuring that data and parity data are properly updated to disks in the system. Another challenge is to accomplish this goal of insuring data consistency and at the same time reduce the time to recover failed disks, reduce recovery time if both a disk and adaptor fail, and reduce network message traffic when handling data and parity updates.
SUMMARY OF THE INVENTION
To provide an improved system for handling updates to data and parity in a shared disk system, preferred embodiments of the present invention disclose a system for updating data. A first processing unit receives a data update to a data block in a first storage device. Parity data for the data block is maintained in a second storage device. A parity group is comprised of the data block and the parity data. After determining that the first processing unit does not control access to the parity group including the data block to update, the first processing unit sends a message to a second processing unit controlling access to the parity group requesting control of access to the parity group The first processing unit determines new parity data from the data update, the data at the data block in the first storage device, and the parity data in the second storage device. The first processing unit then writes the data update to the data block in the first storage device and the new parity data to the second storage device.
In further embodiments, a parity group set indicates a plurality of parity groups. The first processing unit determines a parity group set including the parity group including the data block to update after receiving the data update. The first processing unit then determines whether a first data structure indicates that another data block in the parity group set is being updated. If so, the first processing unit sends a parity group set message to the second processing unit including information on the parity group set including the data block to be updated and a third data structure indicating parity groups recently updated.
Preferred embodiments provide systems and methods for updating data and parity groups and at the same time minimizing network message traffic between the processing units, e.g., adaptors, in the system. Further embodiments use messaging to keep the second adaptor informed of the parity groups being updated. In this way, if the first adaptor fails, the second adaptor can readily determine the inconsistent parity groups that need to be updated or block access to the inconsistent parity groups before recovering failed data. Preferred embodiments seek to balance the goals of reducing message traffic and improving failure recovery time.


REFERENCES:
patent: 4733352 (1988-03-01), Nakamura et al.
patent: 4916605 (1990-04-01), Beardsley et al.
patent: 5140592 (1992-08-01), Idleman
patent: 5208813 (1993-05-01), Stallmo
patent: 5293618 (1994-03-01), Tandai et al.
patent: 5301297 (1994-04-01), Menon et al.
patent: 5373512 (1994-12-01), Brady
patent: 5375128 (1994-12-01), Menon et al.
patent: 5437022 (1995-07-01), Beardsley et al.
patent: 5490248 (1996-02-01), Dan et al.
patent: 5499337 (1996-03-01), Gordon
patent: 5526482 (1996-06-01), Stallmo et al.
patent: 5530830 (1996-06-01), Iwasaki et al.
patent: 5530948 (1996-06-01), Islam
patent: 5546535 (1996-08-01), Stallmo
patent: 5572660 (1996-11-01), Jones
patent: 5574863 (1996-11-01), Nelson
patent: 5574882 (1996-11-01), Menon et al.
patent: 5636359 (1997-06-01), Beardsley et al.
patent: 5640530 (1997-06-01), Beardsley et al.
patent: 5664187 (1997-09-01), Burkes et al.
patent: 5787460 (1998-07-01), Yashiro et al.
patent: 5813016 (1998-09-01), Sumimoto
patent: 5848229 (1998-12-01), Morita
patent: 5913227 (1999-06-01), Raz et al.
patent: 5940856 (1999-08-01), Arimillii et al.
patent: 5940864 (1999-08-01), Arimilli et al.
patent: 5999930 (1999-12-01), Wolff
patent: 6073218 (2000-06-01), DeKoning
Chen, P. et al., “RAID: High-Performance, Reliable Secondary Storage”,ACM Computing Surveys, vol. 26, No. 2, pp. 145-185 (Jun. 1994).
IBM Brochure, “SSA RAID Adapter for PC Servers”, pp. 1-2, ©International Business Machines Corporation 1996.
IBM Manual, “IBM PC ServeRAID Adapter—84H7117 Installation Instructions and User's Guide”, 77 pages, First Edition (Jan. 1997).
IBM Brochure, “3527 SSA Storage Subsystem for PC

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Updating data and parity data with and without read caches does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Updating data and parity data with and without read caches, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Updating data and parity data with and without read caches will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2854524

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.