Fault-tolerant computer system with voter delay buffer

Error detection/correction and fault detection/recovery – Data processing system error or fault handling – Reliability and availability

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C714S045000

Reexamination Certificate

active

06820213

ABSTRACT:

CROSS-REFERENCE TO RELATED APPLICATIONS
This Application is related to the following commonly-assigned, co-pending application entitled METHOD AND SYSTEM FOR UPGRADING FAULT-TOLERANT SYSTEMS, identified by Cesari and McKenna File No. 104160-0010.
BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention is related to fault-tolerant computer systems and, in particular, to a method for efficiently providing reliable operation in a computer system.
2. Background Information
In most data processing applications, reliable performance of a computer system is critical. To provide for a specified level of reliability, the computer system may include at least one redundant, or backup, central processing unit (CPU), where the CPUs perform the same operations and provide the same data output stream. The input/output (I/O) buses of the CPUs are continually monitored and compared to identify any differences in their respective data streams. If signal differences are detected, a voting device applies predetermined criteria to identify one of the CPUs as malfunctioning. In a redundant computer system having two CPUs, for example, the voting device may identify the CPU having a history of greater cumulative error correction as the malfunctioning CPU. However, experience has shown that this method has an unacceptably low accuracy rate.
The accuracy rate improves with the addition of a second redundant CPU to the computer system. All three CPU outputs are monitored and, when differences are detected, the CPU determined to be malfunctioning is the CPU producing an output not in agreement with the other two CPUs. This approach, however, incurs the additional expense and complexity of integrating the third CPU into the computer system.
Another method used in computer systems having only two redundant CPUs is to have each CPU revert to an idle state and/or lose output data while diagnostic procedures are initiated to determine which CPU is malfunctioning. Based on the results of the diagnostic procedure, one CPU may be identified as malfunctioning. One undesirable side effect of this approach is that the operation of the computer system is impacted and may be severely disrupted while the CPUs are in the idle state.
It is therefore an object of the present invention to provide a computer system achieving a high degree of reliability with a redundant CPU.
It is a further object of the present invention to provide such a computer system in which a malfunctioning CPU can be identified without first placing the CPU into an idle state.
It is a still further object of the present invention to provide such a computer system in which computational data is not lost while the malfunctioning CPU is identified.
It is yet another object of the present invention to provide such a system in which a malfunctioning CPU can be identified with a high degree of reliability. Other objects of the invention will be obvious, in part, and, in part, will become apparent when reading the detailed description to follow.
SUMMARY OF THE INVENTION
The present invention comprises a fault-tolerant computer system which includes a pair of CPUs that produce essentially identical data output streams, a voter delay buffer having first and second FIFO buffers, and an I/O module interconnecting the CPUs and the FIFO buffers. The I/O module compares the data output streams from the two CPUs for differences. If both CPU output streams remain identical, the data output of a selected CPU is transmitted to one or more peripheral devices. Otherwise, if the comparator indicates differences, the data output stream from one CPU is rerouted to the first FIFO, and the data output streams from the other CPU is rerouted to the second FIFO. Meanwhile, the CPUs continue processing operations and ongoing diagnostic procedures to identify one of the CPUs as malfunctioning. The FIFOs provide buffering for the data output streams which would otherwise be discarded. Additionally, use of the FIFOs allows the CPUs to continue operation and avoid a disruption to the computer system. If neither CPU is diagnosed as malfunctioning, the I/O module uses data from a priority module to determine which CPU has a higher assigned priority, and identifies the higher-priority CPU as the correctly-functioning CPU. In either case, the computer system then provides the data held in the FIFO associated with the correctly-functioning CPU to the peripheral devices. By thus buffering the data output streams, the present invention allows the computer system to utilize the diagnostic procedures for increasing the probability of correctly identifying a CPU as malfunctioning.


REFERENCES:
patent: 3192362 (1965-06-01), Cheney
patent: 3533065 (1970-10-01), McGilvray
patent: 3533082 (1970-10-01), Schnabel
patent: 3544973 (1970-12-01), Borck et al.
patent: 3548176 (1970-12-01), Shutler
patent: 3593307 (1971-07-01), Gouge, Jr.
patent: 3641505 (1972-02-01), Artz et al.
patent: 3665173 (1972-05-01), Bouricius et al.
patent: 3681578 (1972-08-01), Stevens
patent: 3688274 (1972-08-01), Cormier et al.
patent: 3710324 (1973-01-01), Cohen et al.
patent: 3736566 (1973-05-01), Anderson et al.
patent: 3783250 (1974-01-01), Fletcher et al.
patent: 3795901 (1974-03-01), Boehm et al.
patent: 3805039 (1974-04-01), Stiffler
patent: 3820079 (1974-06-01), Bergh et al.
patent: 3840861 (1974-10-01), Amdahl et al.
patent: 3879712 (1975-04-01), Edge et al.
patent: 3991407 (1976-11-01), Jordan, Jr. et al.
patent: 3997896 (1976-12-01), Cassarino, Jr. et al.
patent: 4015246 (1977-03-01), Hopkins, Jr. et al.
patent: 4030074 (1977-06-01), Giorcelli
patent: 4032893 (1977-06-01), Moran
patent: 4059736 (1977-11-01), Perucca et al.
patent: 4099234 (1978-07-01), Woods et al.
patent: 4176258 (1979-11-01), Jackson
patent: 4228496 (1980-10-01), Katzman et al.
patent: 4245344 (1981-01-01), Richter
patent: 4263649 (1981-04-01), Lapp, Jr.
patent: 4275440 (1981-06-01), Adams, Jr. et al.
patent: 4309754 (1982-01-01), Dinwiddie, Jr.
patent: 4323966 (1982-04-01), Whiteside et al.
patent: 4356550 (1982-10-01), Katzman et al.
patent: 4358823 (1982-11-01), McDonald et al.
patent: 4366535 (1982-12-01), Cedolin et al.
patent: 4369494 (1983-01-01), Bienvenu et al.
patent: 4375683 (1983-03-01), Wensley
patent: 4434463 (1984-02-01), Quinquis et al.
patent: 4449182 (1984-05-01), Rubinson et al.
patent: 4453215 (1984-06-01), Reid
patent: 4467436 (1984-08-01), Chance et al.
patent: 4484273 (1984-11-01), Stiffler et al.
patent: 4486826 (1984-12-01), Wolff et al.
patent: 4503496 (1985-03-01), Holzner et al.
patent: 4503535 (1985-03-01), Budde et al.
patent: 4507784 (1985-03-01), Procter
patent: 4543628 (1985-09-01), Pomfret
patent: 4562575 (1985-12-01), Townsend
patent: 4583224 (1986-04-01), Ishii et al.
patent: 4589066 (1986-05-01), Lam et al.
patent: 4590554 (1986-05-01), Glazer et al.
patent: 4597084 (1986-06-01), Dynneson et al.
patent: 4608631 (1986-08-01), Stiffler et al.
patent: 4610013 (1986-09-01), Long et al.
patent: 4622667 (1986-11-01), Yount
patent: 4628447 (1986-12-01), Cartret et al.
patent: 4630193 (1986-12-01), Kris
patent: 4633394 (1986-12-01), Georgiou et al.
patent: 4633467 (1986-12-01), Abel et al.
patent: 4644498 (1987-02-01), Bedard et al.
patent: 4648031 (1987-03-01), Jenner
patent: 4654846 (1987-03-01), Goodwin et al.
patent: 4654857 (1987-03-01), Samson et al.
patent: 4669056 (1987-05-01), Waldecker et al.
patent: 4669079 (1987-05-01), Blum
patent: 4686677 (1987-08-01), Flora
patent: 4700292 (1987-10-01), Campanini
patent: 4703420 (1987-10-01), Irwin
patent: 4736377 (1988-04-01), Bradley et al.
patent: 4739498 (1988-04-01), Eichhorn
patent: 4750177 (1988-06-01), Hendrie et al.
patent: 4799140 (1989-01-01), Dietz et al.
patent: 4805091 (1989-02-01), Thiel et al.
patent: 4809169 (1989-02-01), Sfarti et al.
patent: 4816990 (1989-03-01), Williams
patent: 4827409 (1989-05-01), Dickson
patent: 4866604 (1989-09-01), Reid
patent: 4872106 (1989-10-01), Slater
patent: 4905181 (1990-02-01), Gregory
patent: 4907232 (1990-03-01), Harper et al.
patent: 4914580 (1990-04-01), Jensen et al.
patent: 4916695 (1990-

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Fault-tolerant computer system with voter delay buffer does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Fault-tolerant computer system with voter delay buffer, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Fault-tolerant computer system with voter delay buffer will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3304600

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.