Techniques for providing data within a data storage system

Error detection/correction and fault detection/recovery – Pulse or data error handling – Data formatting to improve error detection correction...

Reexamination Certificate

Rate now

[ 0.00 ] – not rated yet Voters 0 Comments 0

Details Techniques for providing data within a data storage system Techniques for providing data within a data storage system

: 2001-02-14
: 2004-05-25
: Decady, Albert (Department: 2133)
: Error detection/correction and fault detection/recovery
: Pulse or data error handling
: Data formatting to improve error detection correction...

: C714S758000, C710S020000
: Reexamination Certificate
: active
: 06742146
: ABSTRACT:

BACKGROUND OF THE INVENTION
A typical data storage system stores and retrieves data for one or more external hosts. It is common for such a data storage system to include front-end circuitry, a cache, back-end circuitry, and a set of disk drives. In general, the cache operates as a buffer for data exchanged between the external hosts and the disk drives. The front-end circuitry operates as an interface for transferring data from the hosts to the cache, and vice versa. Similarly, the back-end circuitry operates as an interface for transferring data from the cache to the disk drives, and vice versa.
Some data storage systems are capable of storing and retrieving data having a count-key-data (CKD) record format (hereinafter referred to as CKD data). Such data consists of a count field containing the number of bytes of data, an optional key field by which particular records can be easily recognized, and the data itself. In general, CKD data does not have a standard size. That is, CKD data does not arrive in complete blocks, i.e., consistently aligned with a block or sector boundary. Rather, CKD data is arbitrary in size, varying from transmission to transmission.
Some data storage systems, which are equipped to handle CKD data, associate cyclic redundancy check (CRC) codes with the CKD data for fault tolerance purposes. In one conventional data storage system, when the front-end circuitry receives CKD data for storage, the front-end circuitry associates a CRC code with the CKD data and provides the CKD data and the associated CRC code to the cache. The back-end circuitry then reads the CKD data and the associated CRC code out of the cache, confirms that the CKD data is not corrupt or garbled based on the CRC code, and stores the CKD data and the CRC code on the set of disk drives.
It should be understood that data transfers between components of the above-described conventional data storage system (i.e., between the front-end circuitry and the cache, between the cache and the back-end circuitry, etc.) occur in block-sized or block-aligned operations. The front-end circuitry typically handles conversion of the non-standard-sized CKD data to data blocks. In particular, in response to CKD received from a host for storage, the front-end circuitry provides, to the cache, a block of data including (i) the CKD data, (ii) an associated CRC code appended to an end of the CKD data, and (iii) old, invalid data remaining in the front-end circuitry for alignment with a block boundary (e.g., a 512 byte boundary). It should be understood that the CRC code applies only to the CKD data, and not to the old, invalid data. Furthermore, in a separate signal (e.g., a message to the back-end circuitry), the front-end circuitry identifies the number of bytes of CKD data in the data block so that the back-end circuitry can use that number as an offset to find the CRC code.
As explained earlier, when the back-end circuitry retrieves the block of data from the cache, the back-end circuitry checks the CRC code to confirm that the CKD data is still intact, i.e., verifies that the CKD data is not corrupt or garbled in some manner. To this end, the back-end circuitry generates (i) a second CRC code based on the entire data block, (ii) a third CRC code based only on the old, invalid data, and (iii) an expected value for the initial CRC code for the CKD data (i.e., the CRC code appended to the CKD data within the data block) based on the second and third CRC codes (e.g., by performing an exclusive OR operation on the second and third CRC codes). The back-end circuitry then compares the expected value with the initial CRC code. If there is a match, the back-end circuitry concludes that the CKD data is without error and stores the CKD data (and perhaps the initial CRC code as well) in the disk drives. However, if the generated expected value does not match the initial CRC code, the back-end circuitry concludes that the CKD data includes an error (i.e., that one or more bits of the CKD data is incorrect), and initiates an error handling procedure (e.g., notifies the front-end circuitry that the CKD data includes an error and invites the front-end circuitry to retransmit the CKD data).
SUMMARY OF THE INVENTION
Unfortunately, there are deficiencies to the above-identified conventional data storage system which stores CKD data by including old, invalid data with the CKD data for block alignment purposes. For example, for the back-end circuitry of the above-described conventional data storage system to confirm that the CKD data from the front-end circuitry is not corrupt, the back-end circuitry performs a complex series of operations. In particular, the back-end circuitry generates (i) a second CRC code based on the entire data block containing the CKD data, (ii) a third CRC code based only on the old, invalid data in the data block, and (iii) an expected result based on the second and third CRC codes. The back-end circuitry then compares the expected value with the initial CRC code (i.e., the CRC code appended to the CKD data within the data block) to determine whether the CKD data is corrupt. This complex series of operations, which is typically implemented in software, requires a significant amount of time to complete. As a result, the transfer of CKD data through the back-end circuitry tends to be relatively slow from a performance standpoint compared to transfer times of other types of data due to the large amount of error checking overhead performed by the back-end circuitry.
In contrast to the above-described conventional data storage system, the invention is directed to data storage techniques that include an error detection code and cleared bytes (e.g., zeroes) with certain types of data (e.g., CKD data). The use of cleared bytes with CKD data alleviates the need to perform a complex series of software operations at the back-end to detect corrupted CKD data. Rather, when the CKD data is followed by an appended CRC code and cleared bytes to form an aligned block of data, error checking of the CKD data (and the entire data block) can simply involve generating a CRC code based on the entire data block and comparing that generated CRC code with the initial CRC code appended to the CKD data within that data block. Accordingly, the error detection process is relatively simpler and takes less time than the above-described conventional approach.
One arrangement of the invention is directed to a data storage system that includes a circuit (e.g., a front-end interface) having a memory pipeline that (i) receives a stream of data elements (e.g., CKD data), and (ii) provides a series of byte groups that includes the stream of data elements, an error detection code (e.g., a CRC code) and a set of cleared bytes (e.g., zeroes) to a set of storage devices. The circuit further includes a controller, coupled to the memory pipeline, that provides the error detection code and the set of cleared bytes to the memory pipeline such that each of the series of byte groups provided by the memory pipeline has a same byte width (e.g., eight bytes). The inclusion of the error detection code and the set of cleared bytes enables consistent alignment of each byte group in the series. Furthermore, if the series of byte groups is loaded into an initialized memory sector (e.g., a cleared cache of the data storage system), a CRC code can be (i) generated based on the entire sector and (ii) compared to the CRC code within the series of byte groups to determine whether the stream of data elements is without error.
In one arrangement, the memory pipeline includes an output stage that connects to an external memory, and the controller is configured to direct the memory pipeline to further provide a set of subsequent byte groups exclusively having cleared bytes. In this arrangement, the output stage provides both the series of byte groups and the set of subsequent byte groups to the external memory to exactly fill a sector (e.g., 512 bytes) of an external memory (e.g., cache memory, dual-ported random access memory leading to the cache memory, a disk d

Affiliated with

Gross William K.

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Scaringella Stephen L.

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Tung Victor W.

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Also associated with

Britt Cynthia

Examiner

[ 0.00 ] – not rated yet Voters 0 Comments 0

Chapin & Huang , L.L.C.

Law Firm

[ 0.00 ] – not rated yet Voters 0 Comments 0

De'cady Albert

Examiner

[ 0.00 ] – not rated yet Voters 0 Comments 0

EMC Corporation

Corporate Assignee

[ 0.00 ] – not rated yet Voters 0 Comments 0

Huang, Esq. David E.

Attorney

[ 0.00 ] – not rated yet Voters 0 Comments 0

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Techniques for providing data within a data storage system does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Techniques for providing data within a data storage system, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Techniques for providing data within a data storage system will most certainly appreciate the feedback.

Rate now

Comments { 0 }

Profile ID: LFUS-PAI-O-3216041

All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.

Canada

Charities
Companies
MP Candidates
Patents
Employee Salary Disclosure

World

Places of the World
Scientific Papers

United States

Banks
Companies
Counties
Patents
Employee Salary Disclosure