Distributed, scalable data storage facility with cache memory

Electrical computers and digital processing systems: memory – Storage accessing and control – Shared memory area

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C711S118000, C711S141000, C711S142000, C711S143000, C711S144000, C711S145000, C711S146000, C711S148000, C709S212000, C709S213000, C709S214000, C709S215000, C709S216000

Reexamination Certificate

active

06757790

ABSTRACT:

BACKGROUND OF THE INVENTION
1. Field of the Invention
This invention generally relates to data processing systems and more specifically to data storage facilities for use in such data processing systems.
2. Description of Related Art
Early data processing systems comprised a single processor, random access memory and a data storage facility in the form of a single magnetic disk drive. Such systems are still in wide use by small businesses and individuals and as terminals or nodes in a network. The capacities of the single magnetic disk drive associated with such systems are now into the hundred-gigabyte (i.e., 100*10
9
bytes) range. However, there are many applications in which even these increased capacities no longer are sufficient.
Increased storage capacities required by multi-processing systems with multiple access and increased database sizes have been realized by the development of data storage facilities with disk array storage devices. Concurrently with this development, a need has also arisen to attain redundancy in the data for data integrity purposes. Consequently there now are many applications that require disk storage facilities having terabyte (i.e., 10
12
bytes) and even multiple terabyte storage capacities.
Disk array storage devices have become available from the assignee of this invention and others with such capacities. These systems include a connection to a host system that may include one or more processors and random access memory. Data transfer requests, which include data read and data write requests, are received in an interface or host adapter in the data storage facility and processed into commands that the data storage facility recognizes. These systems use cache memory to enhance operations. A cache memory serves as an intermediate data repository between the physical disk drives and the host systems. Cache memories can reduce the time a data storage facility requires to complete a data read or write operation by returning requested data or by receiving data being sent to the data storage facility.
Such data storage facilities are generally characterized by having a single bus structure that interconnects the physical disk drives, the cache memory and the host adapter. All data commands and all data transfers must pass over this single path. As pressure for increasing data storage capacity and transfer rates continues to increase, the single data path can become a bottleneck. To overcome this bottleneck, some data processing systems now incorporate multiple independent disk array storage devices connected to a single host system. Others incorporate multiple disk array storage devices with multiple host systems.
As these data storage facilities have evolved, so have a number of important characteristics or functional specifications, particularly data redundancy and data coherency. Data redundancy addresses two potential problems. Redundancy at a site overcomes a problem of equipment failure. For example, if data redundancy at a site is achieved by mirroring, two or more separate physical disk drives replicate data. If one of those disk drives fails, the data is available at another physical disk drive. Replicating a disk array storage device at a geographically remote site and storing a copy of the data at each site can also achieve data redundancy. This type of data redundancy overcomes the problem of data loss due to destruction of the equipment at one site because the data at the other site is generally preserved.
Data coherency assures the data at different locations within one or more disk storage facilities is synchronized temporally. That is, if data in a set is stored across two or more separate data storage facilities, at any given instant any one data storage facility should be coherent with the data in the other storage facility. Data could become non-coherent, for example, if a pathway from a host to one of the data storage facilities were to be interrupted without promptly terminating transfers to another related data storage facility.
Generally, a customer initially purchases a disk array storage device with a base data storage facility supplied with a number of magnetic disk drives that provide an initial storage capacity. Often times it is the case that this number of drives is less than a maximum number that the device can support. An incremental increase in the total storage capacity can be achieved merely by adding one or more magnetic disk drives to the existing disk array storage device, generally at an incremental cost. However, when it becomes necessary to expand the capacity beyond the maximum capacity of the disk array storage device, it may become necessary to purchase a new base disk array storage device. The cost of this new base disk array storage device, even with a minimal storage capacity, will be greater than the incremental costs incurred by merely adding magnetic disk drives to the existing disk array storage device. The customer may also incur further programming and reconfiguration costs to integrate the new disk array storage device with the existing disk array storage device.
In many applications, additional capacity is concomitant with a need for greater throughput. However, all the read and write operations for such a disk array storage device continue to involve a single cache memory. Although the cache memory might be expanded, its throughput, measured in the possible number of accesses per unit time, does not increase. In these situations, the capacity increases, but at a reduction in performance as greater rates of read and write operations are encountered. As a result, the ability to scale such disk array storage devices becomes difficult. When such performance problems are anticipated, the usual approach is to add an entirely separate disk array storage device to the data processing system and then to deal with the coordination and coherency issues that may arise.
What is needed is a data storage facility that achieves all the foregoing specifications. That is, what is needed is a data storage facility that provides full redundancy with no single point of failure in the system. Such a data storage facility should be scalable both in terms of the number of host systems that can connect to and the total capacity of the data storage facility. The data storage facility should provide a fully redundant distributed cache memory to provide load balancing and fault tolerance for handling data in the cache memory. Such a facility should be constructed from readily available components with common features for manufacturing and cost efficiency and for limiting the need for spare components necessary to insure reliability. Still further the facility should operate with throughput that is relatively independent of actual storage capacity and the number of host systems connected to that data storage facility.
SUMMARY
Therefore it is an object of this invention to provide a high-performance, distributed cache data storage facility that is scalable to large data storage capacities.
Another object of this invention is to provide a distributed cache, scalable data storage facility that is fully redundant.
Still another object of this invention is to provide a distributed cache, scalable data storage facility that can be scaled both with respect to the number of host systems it serves and the capacity of the storage facility.
Still another object of this invention is to provide a distributed cache, scalable data storage facility that is constructed of readily available components having a common design and for manufacturing and cost efficiency and for reliability.
In accordance with this invention a data storage facility operates with a plurality of data processors, each of which can issue a host request for performing a data transfer with the data storage facility. The data storage facility comprises a plurality of persistent data storage locations at unique addresses in a common address space and control logic for transferring data to and from the addressed locations. A plurality of processor-controlled data hand

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Distributed, scalable data storage facility with cache memory does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Distributed, scalable data storage facility with cache memory, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Distributed, scalable data storage facility with cache memory will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3346796

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.