Automatic failure detection and recovery of applications

Error detection/correction and fault detection/recovery – Data processing system error or fault handling – Reliability and availability

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C714S004110, C714S010000, C714S015000, C714S039000

Reexamination Certificate

active

10090041

ABSTRACT:
A method and apparatus for executing target application programs with employment of a supervisor that is coupled to the executing target application through a conventional status reporting port of the computer. Events that represent execution errors or unexpected behavior are provided to the reporting port, or discovered through the reporting port, and the supervisor responds to such events by perusing a configuration that is tailored to the executing target application. For each reported event, the supervisor retrieves one or more actions that the configuration file specifies for the reported events and executes the specified actions. The configuration file includes actions that aim to effect a recovery, from reported events, including restart application, or a thread of the application, based on checkpointed data, restart target application as soon as it becomes idle, and suspend a thread when a thread starvation condition exists.

REFERENCES:
patent: 6282701 (2001-08-01), Wygodny et al.
patent: 6701454 (2004-03-01), Fischer et al.
patent: 6745350 (2004-06-01), Cline et al.
patent: 6757897 (2004-06-01), Shi et al.
Klemm et al, Enhancing JAVA Server Availability with JAS,Softw. Pract. Exper.2001:31:697-716 16, Mar. 2001.
Wang et al, Checkpointing and Its Applications,Proc. IEEE Fault-Tolerant Computing Symp.(FTCS-25), pp. 22-31, Jun. 1995.
Marzullo et al, Tools for Constructing Distributed Reactive Systems,DARPA, NASA Ames grant No. NAG-593, Contract N00140-87-C-8904, Feb. 22, 1991.
Plattner, Real-Time Execution Monitoring,IEEE Transactions on Software Engineering, vol. SE-10, No. 6, pp. 756-764, Nov. 1984.
Huang et al, NT-Swift: Software Implemented Fault Tolerance in Windows NT,Proceedings of the 2nd USENIX Windows NT Symposium, Seattle, WA, Aug. 3-4, 1998.
Dimaio et al, Execution Monitoring and Debugging Tool for ADA Using Relational Algebra,Proceedings of the ADA International Conference in Use():109-123 1985.
Atkinson et al, A Review of the Rationale and Architectures of PJama: a Durable, Flexible, Evolvable and Scalable Orthogonally Persistent Programming Platform, Sun Microsystems Laboratories, M/S MTV29-112, Jun. 2000.
Cristian, F., “Exception Handling and Tolerance of Software Faults”,Software Fault Tolerance, Chapter 4, Edited By Lyu, 1995, John Wiley & Sons, Ltd.
Nicola, Victor E., “Checkpointing and the Modeling of Program Execution Time”, Software Fault Tolerance, Chapter 7, 1995.
Huang, Y., et al., “Software Fault Tolerance in the Application Layer”, Software Fault Tolerance in th Application Layer, Chapter 10, 1995.
Jane-Ferng Chiu and Ge-Ming Chiu Process-Replication Technique for Fault-Tolerance and Performance Improvement in Distributed Computing Systems, in Proceedings of the Third International Symposium on High Performance Distributed Computing, 1994.
Ganesha Beedubail et al., An Architecture for Object Replication in Distributed Systems, Department of Computer Science, Texas A&M University, College Station, TX 77843, Technical Report (TR-96-006), Mar. 1996.
Huw Evans “Why Object Serialization is Inappropriate for Providing Persistence in Java.” Department of Computer Science, The University of Glasgow, 2000.
Malcolm Atkinson “Persistence and Java A Balancing Act.” Lecture Notes in Computer Science, vol. 1944, 2001, pp. 1 ff., Springer Verlag.
James S. Plank “An Overview of Checkpointing in Uniprocessor and Distributed Systems, Focusing on Implementation and Performance,” Department of Computer Science, University of Tennessee, Report No. UT-CS-97-372, Jul. 1997.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Automatic failure detection and recovery of applications does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Automatic failure detection and recovery of applications, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Automatic failure detection and recovery of applications will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3755406

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.