Data synchronization of multiple remote storage

Electrical computers and digital processing systems: memory – Storage accessing and control – Control technique

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C711S156000, C714S006130, C714S016000, C707S793000, C710S005000, C710S019000, C710S039000

Reexamination Certificate

active

06745303

ABSTRACT:

CROSS-REFERENCES TO RELATED APPLICATIONS
NOT APPLICABLE
STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT
NOT APPLICABLE
REFERENCE TO A “SEQUENCE LISTING,” A TABLE, OR A COMPUTER PROGRAM LISTING APPENDIX SUBMITTED ON A COMPACT DISK.
NOT APPLICABLE
BACKGROUND OF THE INVENTION
The present invention relates generally to data processing storage systems comprising a local or local storage facility and two or more remote storage facilities that mirror at least certain of the data retained by the local storage facility. More particularly, the invention relates to a method, and apparatus implementing that method, to synchronize the data at surviving storage facilities in the event of failure of one of them.
The use of data processing over the years by commercial, military, governmental and other endeavors has resulted in tremendous amounts of data being stored—much of it virtually priceless because of its importance. Businesses, for example, risk collapse should its data be lost. For this reason alone the local data is backed up to one or more copies of the data, and retained for use should the original data be corrupted or lost. The more important the data, the more elaborate the methods of backup. For example, one approach to protecting sensitive or valuable data is to store backup copies of that data at one or more sites that are geographically remote from the local storage facility. Each remote storage facility maintains a mirror image of the data held by the local storage facility, and changes (e.g., writes, deletions, etc.) to the local data image of the local storage facility are transferred and also effected at each of the remote storage facilities so that the mirroring of the local data image is maintained. An example of a remote storage system for mirroring data at a local storage system is shown by U.S. Pat. No. 5,933,653.
Updates sent to the remote storage facilities are often queued and sent as a group to keep the overhead of remote copying operations at a minimum. Also, the transmission medium often used is an Internet connection or similar. For these reasons, the data images mirroring the local data will, at times not be the same. If more than one remote storage is used to mirror the local data, there often will be times when the data images of the remote storages will be different from one another—at least until updated by the local storage facility. These interludes of different data images can be a problem if the local facility fails, leaving only the remote storage facilities. Failure of the local storage facility can leave some remote storage facilities with data images that more closely if not exactly mirror that of the local storage facility before failure, while others have older “stale” data images that were never completely updated by the last update operation. Thus, failure of the local storage facility may require the remote storage facilities to re-synchronize the data between them in order that all have the same and latest data image before restarting the system. There are several approaches to data synchronization.
If removable media (e.g., tape, CD-R, DVD, etc.) is used at the local and remote storage facilities, such removable media can be used. For example, a system administrator will copy data from a selected remote storage facility (the image-donating facility) that is believed to have the most up-to-date data image of the local facility to the tape. Then, in order to keep the data image from changing before it is used to synchronize at the other remote storage facilities, input/output (I/O) operations at the image-donating facility are halted until the tape can be circulated to update the other remote storage facilities. At the remote storage, an administrator copies data from removable media to storage at the remote site. Then, the system administrator re-configures the entire system to that of the selected remote storage facility which now becomes the new local storage facility, and its I/O operations allowed be commence. This approach is efficient when the data involved is small, but not so for larger systems. Larger systems will produce data that grows rapidly, requiring what could be an inordinate amount of time to copy for the entire synchronization process.
Lacking removable media, another approach would be to use any network connections between the various storage facilities to communicate data. This approach requires that one storage facility be selected to replace the former local (but now failed) storage facility. I/O operations at the selected storage facility is halted, for the same reasons stated above, and a re-synchronize copy process is initiated between the selected storage facility and the other remote storage facilities. When the re-synchronization process is complete, I/O operations are restarted at the selected storage facility, and the sytem proceeds as before, albeit with one less storage facility (the failed former local storage facility).
A major problem with this latter approach is the time needed for the re-synchronization process, particularly for larger amounts of data. For example, a storage of 100 terabytes (TB) of data, using 100MB/s network transfer connection, will take approximately 11.57 days to transfer all the data; (100×10
12
/(100×10
6
)=10 sec =277 hours =11.57 days). This is the time for re-synchronization of just one storage facility. If re-synchronize is to be performed for more than one storage facility, the problem is exacerbated. Also, during the re-synchronization process, I/O operations of the storage facilities involved are halted.
BRIEF SUMMARY OF THE INVENTION
The present invention provides a method, and architecture for implementing that method, of synchronizing two or more remote or remote data storage facilities so that they hold and maintain the same data images in the event of a failure of the local storage.
Broadly, the invention pertains to a data processing system comprising a local (local) data storage facility communicatively coupled to (i.e. in communication with) two or more remote or remote storage facilities. Each of the remote storage facilities, whether local or remote, includes storage media data storage. Data maintained on the storage media at the local data storage facility is mirrored on storage media at the remote storage facilities. Changes to the data image of the local storage facility are periodically sent to the remote storage facilities for updating their date images using a remote copy process that sends data messages with the data updates. Each of the storage facilities keeps information that is indicative of the history of what updates have been received by the remote storage facilities and what updates have been received and implemented (by writes to the storage medial of such remote storage facility). In the event of failure of a storage facility, the surviving storage facilities circulate the historical update to determine any differences, if any, of the data images, i.e., have there been updates not received by any of the surviving storage facilities. If so, the surviving storage facilities will synchronize their data images so that all have a substantially identical data image.
According to one embodiment of the invention, synchronization is achieved by a “roll-forward” operation in which that remote storage facility having the latest updates, as indicated by the historical update information, sends those needed updates to the other remote storage facilities for bring up to date all data images. In another “roll-back” operation of synchronization, updates are discarded to bring all data images back to the same level.
Advantages of the invention include the fact that in data processing systems having storages that are mirrored, the mirrored images of the local storage will correspond to one another in the event of a failure of the local storage they mirror.
In another embodiment of the invention queue structures are maintained by each of the storage facilities, identifying, in a roll back

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Data synchronization of multiple remote storage does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Data synchronization of multiple remote storage, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Data synchronization of multiple remote storage will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3365977

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.