Error detection/correction and fault detection/recovery – Data processing system error or fault handling – Reliability and availability
Reexamination Certificate
2007-09-25
2007-09-25
Beausoliel, Robert (Department: 2113)
Error detection/correction and fault detection/recovery
Data processing system error or fault handling
Reliability and availability
C714S013000, C714S015000, C714S038110, C714S049000
Reexamination Certificate
active
10836538
ABSTRACT:
A method of restoring processes within a process domain begins with a step of restoring a tree of processes in which at least two of the processes share at least a resource. The method continues with a step of restoring a checkpoint state of each resource used by the processes after a time when a possible need for a restoration state of the resource exists. According to an embodiment, the restoration state comprises information used by the method during the step of restoring the tree of processes. According to another embodiment, the restoration state comprises information used by the method during the step of restoring the checkpoint state of one or more particular resources. The method concludes with a step of resuming execution of each process after restoration of the checkpoint state of the resources used by the process.
REFERENCES:
patent: 6338147 (2002-01-01), Meth et al.
patent: 6594779 (2003-07-01), Chandra et al.
patent: 7117354 (2006-10-01), Browning et al.
patent: 2002/0087916 (2002-07-01), Meth
Bouteiller, A., et al., Coordinated checkpoint versus message log for fault tolerant MPI, Dec. 2003.
Duell, J., The Design and Implementation of Berkeley Lab's Linux Checkpoint/Restart, 2003.
Litzkow, M., et al., Checkpoint and Migration of UNIX Processes in the Condor Distributed Processing System, 1997.
Osman, S., et al., The Design and Implementation of Zap: A System for Migrating Computing Environments, Proc. OSDI 2002, Dec. 2002.
Plank, J.S., et al., Libckpt: Transparent Checkpointing under Unix, < http://www.cs.utk.edu/plank/plank/papers/USENIX-95W.html> , 1995.
Plank, J.S., An Overview of Checkpointing in Uniprocessor and Distributed Systems, Focusing on Implementation and Performance, Tech. Report UT-CS-97-372, Univ. of Tenn., Knoxville, Tenn., Jul. 1997.
Stellner, G., CoCheck: Checkpointing and Process Migration for MPI, 1996.
Youhui, Z., et al., Checkpointing and Migration of parallel processes based on Message Passing Interface, Oct. 2002.
Zhong, H., et al., CRAK: Linux Checkpoint/Restart As a Kernel Module, Technical Report CUCS-014-01, < http://www.ncl.cs.columbia/research/migrate/crak.html> , Nov. 2001.
Janakiraman Gopalakrishnan
Lowell David E.
Santos Jose Renato
Subhraveti Dinesh Kumar
Turner Yoshio Frank
Beausoliel Robert
Ehne Charles
Hewlett--Packard Development Company, L.P.
Lange Richard P.
LandOfFree
Method of restoring processes within process domain does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method of restoring processes within process domain, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method of restoring processes within process domain will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3796429