Error detection/correction and fault detection/recovery – Data processing system error or fault handling – Reliability and availability
Patent
1997-06-09
2000-06-06
Baker, Stephen M.
Error detection/correction and fault detection/recovery
Data processing system error or fault handling
Reliability and availability
G06F 1120, G06F 1116
Patent
active
060732512
ABSTRACT:
A computer system in a fault-tolerant configuration employs multiple identical CPUs executing the same instruction stream, with multiple, identical memory modules in the address space of the CPUs storing duplicates of the same data. The system detects faults in the CPUs and memory modules, and places a faulty unit offline while continuing to operate using the good units. The faulty unit can be replaced and reintegrated into the system without shutdown. The multiple CPUs are loosely synchronized, as by detecting events such as memory references and stalling any CPU ahead of others until all execute the function simultaneously; interrupts can be synchronized by ensuring that all CPUs implement the interrupt at the same point in their instruction stream. Memory references via the separate CPU-to-memory busses are voted at the three separate ports of each of the memory modules. I/O functions are implemented using two identical I/O busses, each of which is separately coupled to only one of the memory modules. A number of I/O processors are coupled to both I/O busses. I/O devices are accessed through a pair of identical (redundant) processors, but only one is designated to actively control a given device; in case of failure of one I/O processor, however, an I/O device can be accessed by the other one without system shutdown.
REFERENCES:
patent: 3602900 (1971-08-01), Delaigue et al.
patent: 3665173 (1972-05-01), Bouricius et al.
patent: 3681578 (1972-08-01), Stevens
patent: 3735356 (1973-05-01), Yates
patent: 3737870 (1973-06-01), Carter et al.
patent: 3760364 (1973-09-01), Yamauchi et al.
patent: 3761884 (1973-09-01), Avsan et al.
patent: 3810119 (1974-05-01), Zieve et al.
patent: 3828321 (1974-08-01), Wilber et al.
patent: 3833798 (1974-09-01), Schaffer
patent: 3848116 (1974-11-01), Moder et al.
patent: 3864670 (1975-02-01), Inoue et al.
patent: 3921149 (1975-11-01), Kreis et al.
patent: 4015243 (1977-03-01), Kurpanek et al.
patent: 4015246 (1977-03-01), Hopkins, Jr. et al.
patent: 4034347 (1977-07-01), Probert, Jr.
patent: 4187538 (1980-02-01), Douglas et al.
patent: 4204249 (1980-05-01), Dye et al.
patent: 4224664 (1980-09-01), Trinchieri
patent: 4228496 (1980-10-01), Katzman et al.
patent: 4234920 (1980-11-01), Van Ness et al.
patent: 4253144 (1981-02-01), Bellamy et al.
patent: 4257097 (1981-03-01), Moran
patent: 4315310 (1982-02-01), Bayliss et al.
patent: 4316245 (1982-02-01), Luu et al.
patent: 4321666 (1982-03-01), Tasar et al.
patent: 4330826 (1982-05-01), Whiteside et al.
patent: 4342084 (1982-07-01), Sager et al.
patent: 4358823 (1982-11-01), McDonald et al.
patent: 4366535 (1982-12-01), Cedolin et al.
patent: 4375683 (1983-03-01), Wensley
patent: 4380046 (1983-04-01), Fung
patent: 4392196 (1983-07-01), Glenn et al.
patent: 4392199 (1983-07-01), Schmitter et al.
patent: 4399504 (1983-08-01), Obermarck et al.
patent: 4402045 (1983-08-01), Krol
patent: 4412218 (1983-10-01), Niitsu
patent: 4412281 (1983-10-01), Works
patent: 4414624 (1983-11-01), Summer, Jr. et al.
patent: 4426681 (1984-01-01), Bacot et al.
patent: 4430707 (1984-02-01), Kim
patent: 4432051 (1984-02-01), Bogaert et al.
patent: 4438494 (1984-03-01), Budde et al.
patent: 4453215 (1984-06-01), Reid
patent: 4455605 (1984-06-01), Cormier et al.
patent: 4458307 (1984-07-01), McAnlis et al.
patent: 4493019 (1985-01-01), Kim et al.
patent: 4497059 (1985-01-01), Smith
patent: 4541094 (1985-09-01), Stiffler et al.
patent: 4564903 (1986-01-01), Guyette et al.
patent: 4570261 (1986-02-01), Maher
patent: 4577272 (1986-03-01), Ballew et al.
patent: 4589066 (1986-05-01), Lam et al.
patent: 4591977 (1986-05-01), Nissen et al.
patent: 4597084 (1986-06-01), Dynneson et al.
patent: 4607365 (1986-08-01), Greig et al.
patent: 4608688 (1986-08-01), Hansen et al.
patent: 4616312 (1986-10-01), Uebel
patent: 4633394 (1986-12-01), Georgiou et al.
patent: 4638427 (1987-01-01), Martin
patent: 4644498 (1987-02-01), Bedard et al.
patent: 4646231 (1987-02-01), Green et al.
patent: 4648035 (1987-03-01), Fava et al.
patent: 4654857 (1987-03-01), Samson et al.
patent: 4661900 (1987-04-01), Chen et al.
patent: 4667287 (1987-05-01), Allen et al.
patent: 4672535 (1987-06-01), Katzman et al.
patent: 4683570 (1987-07-01), Bedard et al.
patent: 4703452 (1987-10-01), Abrant et al.
patent: 4709325 (1987-11-01), Yajima
patent: 4733353 (1988-03-01), Jaswa
patent: 4751639 (1988-06-01), Corcoran et al.
patent: 4754397 (1988-06-01), Varaga et al.
patent: 4757442 (1988-07-01), Sakata
patent: 4757505 (1988-07-01), Marrington et al.
patent: 4763333 (1988-08-01), Byrd
patent: 4774709 (1988-09-01), Tulplue et al.
patent: 4779008 (1988-10-01), Kessels
patent: 4783731 (1988-11-01), Miyzaki et al.
patent: 4783733 (1988-11-01), Greig et al.
patent: 4785453 (1988-11-01), Chandran et al.
patent: 4794601 (1988-12-01), Kikuchi
patent: 4799140 (1989-01-01), Dietz et al.
patent: 4800462 (1989-01-01), Zacher et al.
patent: 4805107 (1989-02-01), Kieckhafer et al.
patent: 4819159 (1989-04-01), Shipley et al.
patent: 4823256 (1989-04-01), Bishop et al.
patent: 4827401 (1989-05-01), Hrustich et al.
patent: 4831520 (1989-05-01), Rubinfeld et al.
patent: 4845419 (1989-07-01), Hacker
patent: 4847837 (1989-07-01), Morales et al.
patent: 4849979 (1989-07-01), Maccianti et al.
patent: 4853872 (1989-08-01), Shimoi
patent: 4860333 (1989-08-01), Bitzinger et al.
patent: 4868818 (1989-09-01), Madan et al.
patent: 4868826 (1989-09-01), Smith et al.
patent: 4868832 (1989-09-01), Marrington et al.
patent: 4873685 (1989-10-01), Millis, Jr.
patent: 4879971 (1989-11-01), McNally et al.
patent: 4907232 (1990-03-01), Harper et al.
patent: 4912698 (1990-03-01), Bitzinger et al.
patent: 4914657 (1990-04-01), Walter et al.
patent: 4933940 (1990-06-01), Walter et al.
patent: 4937741 (1990-06-01), Harper et al.
patent: 4959774 (1990-09-01), Davis
patent: 4965717 (1990-10-01), Cutts, Jr. et al.
patent: 5018148 (1991-05-01), Patel et al.
patent: 5020059 (1991-05-01), Gorin et al.
patent: 5090014 (1992-02-01), Polich et al.
patent: 5126889 (1992-06-01), Walden
Dishon et al., "Disk Dual Copy Methods and their Performance", IEEE Fault Tolerant Computing Symposium, 1988, pp. 314-319, Dec. 1988.
Bitton, D., "Arm Scheduling in Shadowed Disks", Digest of Papers, Spring COMPCOM 89, pp. 132-136, Mar. 1989.
Malaiya, Y., "Fault-Tolerance in Multiple Processor Systems", Proc. IEEE Intl. Conf. on Circuits and Computers., Oct. 1-3, 1980, Chester, NY, pp. 710-716.
Wensley, J. "Fault-Tolerant Computers Ensure Reliable Industrial Controls", Electronic Design, Jun. 25, 1981, pp. 129-135.
Boggs, D., "Fault-Tolerance Computer Enhances Control System Reliability", Control Engineering, Sep. 1981, pp. 129-132.
Wensley, J. "Industrial-Control System Does Things in Threes for Safety", Electronic, Jan. 27, 1983, pp. 98-102.
Yoneda, T. et al., "Implementation of Interrupt Handler for Loosely-Synchronized TMR Systems", 15th Am. Symp. on Fault-Tolerant Computing Jun. 1985, pp. 246-251.
Bates, K. et al., "Shadowing Boosts System Reliability" Computer Design, Apr. 1985, pp. 129-137.
Kirrmann, H. "Fault-Tolerance in Process Control: An Overview Examples of European Products", IEEE Micro Oct. 1987, pp. 27-50.
Chester, M. "Fault-Tolerant Computers Mature", Systems & Software, Mar. 1985, pp. 117-129.
"Fault Tolerant Systems Can Prevent Timing Problems" by John H. Wensley Computer Design, vol. 21(1982) No. 11, Winchester, Massachusetts.
"Comparative Architecture of High-Availability Computer Systems" by McClusky & Ogus; Inst. of Electrical & Electronics Engineers, Spring Conf. 14; 1977.
"Architectural Description of a Fault-Tolerant Multiprocessor Engineering Prototype" by Smith and Hopkins; 8th Ann. Int'l. Conf. on Fault-Tolerant Computing Jun., 1978.
E.I. Cohen et al. "Storage Hierarchies", 1989, IBM Systems Journal, vol. 28, No. 1, pp. 62-76.
Davies & Wakerly, Synchronization and Matching Redundant Systems, IEEE Trans. on Computers, Jun. 1978, pp. 531-539.
Yoneda et al, "Implementation of Interrupt Handler for Loosley-Synchronized TMR Systems", Proc. 15.sup.th Ann. Symp. on Fault-Tolerant Computing, Jun. '85, p. 2.
McConnel & Sie
Banton Randall G.
Bereiter Tom
Cutts, Jr. Richard W.
Debacker Kenneth C.
Fey, Jr. Krayn W.
Baker Stephen M.
Compaq Computer Corporation
LandOfFree
Fault-tolerant computer system with online recovery and reintegr does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Fault-tolerant computer system with online recovery and reintegr, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Fault-tolerant computer system with online recovery and reintegr will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2224226