Error detection/correction and fault detection/recovery – Data processing system error or fault handling – Reliability and availability
Reexamination Certificate
1998-08-28
2001-10-23
Beausoleil, Robert (Department: 2184)
Error detection/correction and fault detection/recovery
Data processing system error or fault handling
Reliability and availability
C714S003000
Reexamination Certificate
active
06308284
ABSTRACT:
BACKGROUND OF THE INVENTION
1. Field of the Invention
This invention generally relates to the storage of data for use in data processing systems. More particularly, this invention relates to maintaining data integrity and consistency in redundant storage systems.
2. Description of Related Art
Nearly all data processing system users are concerned with maintaining back-up data in order to insure continued data processing operations should their data become lost, damaged or otherwise unusable. Such back-up operations can be achieved through a variety of procedures. In one approach, copies of data on a primary storage device are made on the same or other media such as magnetic tape to provide an historical backup. Typically, however, these systems require all other operations in the data processing system to terminate while the backup is underway.
More recently disk redundancy has evolved as an alternative or complement to historical tape backups. Generally a redundant system uses two or more disk storage devices to store data in a form that enables the data to be recovered if one disk storage device becomes disabled. For example, a first disk storage device stores the data and a second disk storage device mirrors that data. Whenever a transfer is made to the first disk storage device, the data also transfers to the second disk storage device. Typically separate controllers and paths interconnect the two disk storage devices to the remainder of the computer system. One advantage of this type of system is that the redundant copy is made without interrupting normal operations.
Several systems have been proposed for providing concurrent backups to provide the advantage of a tape backup without interrupting normal operations. For example, U.S. Pat. No. 5,212,784 to Sparks discloses an automated concurrent data backup system in which a central processing unit (CPU) transfers data to and from storage devices through a primary controller. The primary controller connects through first and second independent buses to first and second mirrored storage devices respectively (i.e., a primary, or mirrored, storage device and a secondary, or mirroring, storage device). A backup controller and device connect to one or more secondary storage devices through its bus. Normally the primary controller writes data to the primary and secondary data storage devices. The CPU initiates a backup through the primary controller. In response the backup controller takes control of the second bus and transfers data from one secondary data storage device to the backup media. Applications continue to update the primary and any additional secondary storage devices. After a backup operation is completed, the primary controller resynchronizes the storage devices by updating the secondary storage device that acted as a source for the backup with any changes that occurred to the primary data storage device while the backup operation was underway.
U.S. Pat. Nos. 5,241,668 and 5,241,670 to Eastridge et al. disclose different aspects of concurrent backup procedures. In accordance with these references a request for a backup copy designates a portion of the stored data called a “dataset”. For example, if the data storage devices contain a plurality of discrete data bases, a dataset could include files associated with one such data base. In a normal operation, the application is suspended to allow the generation of an address concordance for the designated datasets. Execution of the application then resumes. A resource manager manages all input and output functions between the storage sub-systems and associated memory and temporary memory. The backup copy forms on a scheduled and opportunistic basis by copying the designated datasets from the storage sub-systems and updating the address concordance in response to the copying. Application updates are processed during formation of the backup copy by buffering the updates, copying the effected uncopied designated datasets to a storage sub-system memory, updating the address concordance in response to the copying, and processing the updates. The designated datasets can also be copied to the temporary storage memory if the number of designated datasets exceeds some threshold. The designated datasets are also copied to an alternate memory from the storage sub-system, storage sub-system memory and temporary host memory utilizing the resource manager and the altered address concordance to create a specified order backup copy of the designated datasets from the copied portions of the designated datasets without user intervention.
Still referring to the Eastridge et al. patents, if an abnormal event occurs requiring termination of the backup, a status indication is entered into activity tables associated with the plurality of storage sub-systems and devices in response to the initiation of the backup session. If an external condition exists that requires the backup to be interrupted, the backup copy session terminates and indications within the activity tables are reviewed to determine the status of the backup if a reset notification is raised by a storage sub-system. This enables the determination of track extents which are active for a volume associated with a particular session. A comparison is then made between the track events which are active and volume and track extents information associated with a physical session identification. If a match exists between the track extents which are active and the volume of and track extent information associated with a physical session identification, the backup session resumes. If the match does not exist, the backup terminates.
U.S. Pat. No. 5,473,776 to Nosaki et al. discloses a concurrent backup operation in a computer system having a central processing unit and a multiple memory constituted by a plurality of memory devices for on-line storage of data processed by tasks of the central processing unit. A data backup memory is provided for saving data of the multiple memory. The central processing unit performs parallel processing of user tasks and a maintenance task. The user tasks include those that write currently processed data into the multiple memory. The maintenance task stops any updating of memory devices as a part of the multiple memory and saves the data to a data backup memory.
More recently the concept of redundancy has come to include geographically remote data facilities. As described in U.S. Pat. No. 5,544,347 to Yanai et al. for Remote Data Mirroring and U.S. Pat. No. 5,742,792 to Yanai et al. for Remote Data Mirroring (both assigned to the assignee of this invention), a computer system includes one or more local and one or more remote data facilities. Each local and remote data facility typically includes a data processing system with disk storage. A communications path, that may comprise one or more individual communications links, interconnects a local storages facility with a remote storage facility that is a mirror for the local storage facility. The physical separation can be measured in any range between meters and hundreds or even thousands of kilometers. In whatever form, the remote data facility provides data integrity with respect to any system errors produced by power failures, equipment failures and the like.
In prior art systems one dataset normally is stored in a single storage facility, so data consistency has been achieved whenever the remote storage facility exactly mirrors the local storage facility; i.e, the two facilities are in synchronism. Generally if a communications path comprising one or more communications links, fails (i.e., no data can be transferred over any of the communications links), the dataset remains in the remote storage facility, but no longer is updated. This becomes particularly important when data must be recovered because without consistency or synchronism data in a dataset that has not yet reached the remote or backup facility may be lost.
U.S. Pat. No. 5,720,029 to Kern et al. discloses one approach for providing a disaster recover system that utilizes a synchronous remote da
LeCrone Douglas E.
Murphy Daniel A.
Ofek Yuval
Beausoleil Robert
Bonzo Bryce P.
EMC Corporation
Herbster George A.
LandOfFree
Method and apparatus for maintaining data coherency does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for maintaining data coherency, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for maintaining data coherency will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2605421