Error detection/correction and fault detection/recovery – Data processing system error or fault handling – Reliability and availability
Reexamination Certificate
2006-08-22
2009-06-16
Beausoliel, Robert (Department: 2113)
Error detection/correction and fault detection/recovery
Data processing system error or fault handling
Reliability and availability
C370S216000
Reexamination Certificate
active
07549090
ABSTRACT:
An apparatus, program product and method propagate errors detected in an IO fabric element from an IO fabric that is used to couple a plurality of endpoint IO resources to processing elements in a computer. In particular, such errors are propagated to the endpoint IO resources affected by the IO fabric element in connection with recovering from the errors in the IO fabric element. By doing so, a device driver or other program code used to access each affected IO resources may be permitted to asynchronously recover from the propagated error in its associated IO resource, and often without requiring the recovery from the error in the IO fabric element to wait for recovery to be completed for each of the affected IO resources. In addition, an IO fabric may be dynamically configured to support both recoverable and non-recoverable endpoint IO resources. In particular, IO fabric elements within an IO fabric may be dynamically configured to enable machine check signaling in such IO fabric elements in response to detection that an endpoint IO resource is non-recoverable in nature. The IO fabric elements that are dynamically configured as such are disposed within a hardware path that is defined between the non-recoverable resource and a processor that accesses the non-recoverable resource.
REFERENCES:
patent: 4672537 (1987-06-01), Katzman et al.
patent: 5394542 (1995-02-01), Frey et al.
patent: 5978938 (1999-11-01), Kaiser et al.
patent: 5991900 (1999-11-01), Garnett
patent: 6032271 (2000-02-01), Goodrum et al.
patent: 6643727 (2003-11-01), Arndt et al.
patent: 6829729 (2004-12-01), Hicks et al.
patent: 6901537 (2005-05-01), Dawkins et al.
patent: 6934888 (2005-08-01), McIntosh et al.
patent: 6976191 (2005-12-01), Kitamorn et al.
patent: 2002/0184576 (2002-12-01), Arndt et al.
“Operation and Use, vol. 1 Using the Parallel Operating Environment” by IBM, obtained Jan. 6, 2006 from “http://www.nersc.gov/vendor—docs/ibm/pe/am102mst12.html”.
Bailey David Alan
Nguyen Trung Ngoc
Nordstrom Gregory Michael
Patel Kanisha
Thurber Steven Mark
Beausoliel Robert
Guyton Philip
International Business Machines - Corporation
Wood Herron & Evans LLP
LandOfFree
Autonomic recovery from hardware errors in an input/output... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Autonomic recovery from hardware errors in an input/output..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Autonomic recovery from hardware errors in an input/output... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4085479