Error detection/correction and fault detection/recovery – Data processing system error or fault handling – Reliability and availability
Reexamination Certificate
2007-07-10
2007-07-10
Beausoliel, Robert (Department: 2113)
Error detection/correction and fault detection/recovery
Data processing system error or fault handling
Reliability and availability
C714S004110, C714S010000, C714S015000, C714S039000
Reexamination Certificate
active
10090041
ABSTRACT:
A method and apparatus for executing target application programs with employment of a supervisor that is coupled to the executing target application through a conventional status reporting port of the computer. Events that represent execution errors or unexpected behavior are provided to the reporting port, or discovered through the reporting port, and the supervisor responds to such events by perusing a configuration that is tailored to the executing target application. For each reported event, the supervisor retrieves one or more actions that the configuration file specifies for the reported events and executes the specified actions. The configuration file includes actions that aim to effect a recovery, from reported events, including restart application, or a thread of the application, based on checkpointed data, restart target application as soon as it becomes idle, and suspend a thread when a thread starvation condition exists.
REFERENCES:
patent: 6282701 (2001-08-01), Wygodny et al.
patent: 6701454 (2004-03-01), Fischer et al.
patent: 6745350 (2004-06-01), Cline et al.
patent: 6757897 (2004-06-01), Shi et al.
Klemm et al, Enhancing JAVA Server Availability with JAS,Softw. Pract. Exper.2001:31:697-716 16, Mar. 2001.
Wang et al, Checkpointing and Its Applications,Proc. IEEE Fault-Tolerant Computing Symp.(FTCS-25), pp. 22-31, Jun. 1995.
Marzullo et al, Tools for Constructing Distributed Reactive Systems,DARPA, NASA Ames grant No. NAG-593, Contract N00140-87-C-8904, Feb. 22, 1991.
Plattner, Real-Time Execution Monitoring,IEEE Transactions on Software Engineering, vol. SE-10, No. 6, pp. 756-764, Nov. 1984.
Huang et al, NT-Swift: Software Implemented Fault Tolerance in Windows NT,Proceedings of the 2nd USENIX Windows NT Symposium, Seattle, WA, Aug. 3-4, 1998.
Dimaio et al, Execution Monitoring and Debugging Tool for ADA Using Relational Algebra,Proceedings of the ADA International Conference in Use():109-123 1985.
Atkinson et al, A Review of the Rationale and Architectures of PJama: a Durable, Flexible, Evolvable and Scalable Orthogonally Persistent Programming Platform, Sun Microsystems Laboratories, M/S MTV29-112, Jun. 2000.
Cristian, F., “Exception Handling and Tolerance of Software Faults”,Software Fault Tolerance, Chapter 4, Edited By Lyu, 1995, John Wiley & Sons, Ltd.
Nicola, Victor E., “Checkpointing and the Modeling of Program Execution Time”, Software Fault Tolerance, Chapter 7, 1995.
Huang, Y., et al., “Software Fault Tolerance in the Application Layer”, Software Fault Tolerance in th Application Layer, Chapter 10, 1995.
Jane-Ferng Chiu and Ge-Ming Chiu Process-Replication Technique for Fault-Tolerance and Performance Improvement in Distributed Computing Systems, in Proceedings of the Third International Symposium on High Performance Distributed Computing, 1994.
Ganesha Beedubail et al., An Architecture for Object Replication in Distributed Systems, Department of Computer Science, Texas A&M University, College Station, TX 77843, Technical Report (TR-96-006), Mar. 1996.
Huw Evans “Why Object Serialization is Inappropriate for Providing Persistence in Java.” Department of Computer Science, The University of Glasgow, 2000.
Malcolm Atkinson “Persistence and Java A Balancing Act.” Lecture Notes in Computer Science, vol. 1944, 2001, pp. 1 ff., Springer Verlag.
James S. Plank “An Overview of Checkpointing in Uniprocessor and Distributed Systems, Focusing on Implementation and Performance,” Department of Computer Science, University of Tennessee, Report No. UT-CS-97-372, Jul. 1997.
Klemm Reinhard P.
Singh Navjot
Avaya Technology LLC
Beausoliel Robert
Cohen Pontani Lieberman & Pavane LLP
Manoskey Joseph
LandOfFree
Automatic failure detection and recovery of applications does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Automatic failure detection and recovery of applications, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Automatic failure detection and recovery of applications will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3755406