Error detection/correction and fault detection/recovery – Data processing system error or fault handling – Reliability and availability
Reexamination Certificate
2007-03-13
2007-03-13
Beausoliel, Robert (Department: 2113)
Error detection/correction and fault detection/recovery
Data processing system error or fault handling
Reliability and availability
C714S004110, C714S042000, C714S006130, C709S205000, C718S105000
Reexamination Certificate
active
10324276
ABSTRACT:
A hybrid quorum/consensus and primary-backup fault-tolerance model in an object-based distributed data storage system. When a primary manager fails, a hierarchy of network entities is established in which a group of realm managers first authorizes a failure-handling event through quorum/consensus and a backup manager (for the failing primary manager) then executes the decision of the quorum of realm managers. The realm managers, operating by consensus, determine whether (a) the primary manager can indeed be asserted to be down, and (b) whether there is a quorum of realm managers in agreement on this decision. If both are true, a master realm manager instructs the backup manager to proceed to execute the necessary steps to become the primary manager and function as the primary until the original primary manager is brought back into service. The hybrid fault-tolerance approach handles both single unit failures and network partitions in a unified way, without creating a single cluster out of the fault domain.
REFERENCES:
patent: 5845082 (1998-12-01), Murakami
patent: 5946686 (1999-08-01), Schmuck et al.
patent: 5948109 (1999-09-01), Moiin et al.
patent: 5956734 (1999-09-01), Schmuck et al.
patent: 5960446 (1999-09-01), Schmuck et al.
patent: 5987477 (1999-11-01), Schmuck et al.
patent: 6023706 (2000-02-01), Schmuck et al.
patent: 6233623 (2001-05-01), Jeffords et al.
patent: 6292905 (2001-09-01), Wallach et al.
patent: 6675199 (2004-01-01), Mohammed et al.
patent: 2003/0023680 (2003-01-01), Shirriff
patent: 2006/0036896 (2006-02-01), Gamache et al.
patent: 2006/0090095 (2006-04-01), Massa et al.
patent: 001107119 (2001-06-01), None
Fu, Ada Waichee, Delay-Optimal Quorum Consensus for Distributed Systems, IEEE Transactions on Parallel and Distributed Systems, vol. 8, No. 1, Jan. 1997.
Article by Leslie Lamport entitled “The Part-Time Parliament,” pp. i-vi, 1-42, Digital Equipment Corporation, Sep. 1, 1989.
Article by Garth A. Gibson et al. entitled “A Cost-Effective, High-Bandwidth Storage Architecture,” pp. 92-103, Association for Computing Machinery, 1998.
Article by Andreas Dilger & Peter J. Braam entitled “Object Based Storage HOWTO,” pp. 1-13, v. 1.2, Dec. 23, 1999, available at http://www.lustre.org/docs.
Article by Garth A. Gibson and Rodney Van Meter entitled “Network Attached Storage Architecture,” pp. 37-45, Communications of the ACM, Nov. 2000, vol. 43, No. 11.
Gibson Garth A.
Holland Mark C.
Zelenka James D.
Beausoliel Robert
McCarthy Christopher
Morgan & Lewis & Bockius, LLP
Panasas, Inc.
LandOfFree
Hybrid quorum/primary-backup fault-tolerance model does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Hybrid quorum/primary-backup fault-tolerance model, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Hybrid quorum/primary-backup fault-tolerance model will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3749128