Error detection/correction and fault detection/recovery – Data processing system error or fault handling – Reliability and availability
Patent
1998-04-06
1999-09-28
Hua, Ly V.
Error detection/correction and fault detection/recovery
Data processing system error or fault handling
Reliability and availability
714 10, G06F 1100, G06F 1116
Patent
active
059580704
ABSTRACT:
A mechanism for maintaining a consistent, periodically updated state in main memory without constraining normal computer operation is provided, thereby enabling a computer system to recover from faults without loss of data or processing continuity. In this invention, a first computer includes a processor and input/output elements connected to a main memory subsystem including a primary element. A second computer has a remote checkpoint memory element, which may include one or more buffer memories and a shadow memory, which is connected to the main memory subsystem of the first computer. During normal processing, an image of data written to the primary memory element is captured by the remote checkpoint memory element. When a new checkpoint is desired (thereby establishing a consistent state in main memory to which all executing applications can safely return following a fault), the data previously captured is used to establish a new checkpointed state in the second computer. In case of failure of the first computer, the second computer can be restarted to operate from the last checkpoint established for the first computer. This structure and protocol can guarantee a consistent state in main memory, thus enabling fault-tolerant operation.
REFERENCES:
patent: 4740969 (1988-04-01), Fremont
patent: 5235700 (1993-08-01), Alaiwan et al.
patent: 5408649 (1995-04-01), Beshears et al.
patent: 5504861 (1996-04-01), Crockett et al.
patent: 5600796 (1997-02-01), Okamura et al.
patent: 5737514 (1998-04-01), Stiffler
patent: 5745672 (1998-04-01), Stiffler
patent: 5751939 (1998-05-01), Stiffler
patent: 5787243 (1998-07-01), Stiffler
Hua Ly V.
Texas Micro, Inc.
LandOfFree
Remote checkpoint memory system and protocol for fault-tolerant does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Remote checkpoint memory system and protocol for fault-tolerant , we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Remote checkpoint memory system and protocol for fault-tolerant will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-697980