Error detection/correction and fault detection/recovery – Data processing system error or fault handling – Reliability and availability
Reexamination Certificate
2009-05-28
2011-12-13
Ehne, Charles (Department: 2113)
Error detection/correction and fault detection/recovery
Data processing system error or fault handling
Reliability and availability
C714S005100, C714S047100, C714S047300, C702S185000, C702S186000
Reexamination Certificate
active
08078913
ABSTRACT:
Methods for automatically identifying and classifying a crisis state occurring in a system having a plurality of computer resources. Signals are received from a device that collects the signals from each computer resource in the system. For each epoch, an epoch fingerprint is generated. Upon detecting a performance crisis within the system, a crisis fingerprint is generated consisting of at least one epoch fingerprint. The technology is able to identify that a performance crisis has previously occurred within the datacenter if a generated crisis fingerprint favorably matches any of the model crisis fingerprints stored in a database. The technology may also predict that a crisis is about to occur.
REFERENCES:
patent: 6327677 (2001-12-01), Garg et al.
patent: 6748416 (2004-06-01), Carpenter et al.
patent: 7184935 (2007-02-01), Cohen et al.
patent: 7801703 (2010-09-01), Marvasti
patent: 2003/0110007 (2003-06-01), McGee et al.
patent: 2006/0188011 (2006-08-01), Goldszmidt et al.
patent: 2008/0126881 (2008-05-01), Bruckhaus
patent: 2008/0235283 (2008-09-01), Turnball et al.
patent: 2008/0250497 (2008-10-01), Mullarkey et al.
patent: 2010/0223499 (2010-09-01), Panigrahy et al.
Gabriela Cretu, “Hunting for Problems with Artemis”, Microsoft Research, http://www.usenix.org/events/wasI08/tech/full—papers/cretu/cretu—html/.
Peter Bodik, “HiLighter: Automatically Building Robust Signatures of Performance Behavior for Small- and Large-Scale Systems”, Nov. 22, 2008, RAD Lab, EECS Department, UC Berkeley, Microsoft Research, Silicon Valley, http://www.usenix.org/events/sysmI08/tech/full—papers/bodik/bodik—html/.
Tarek F. Abdelzaher, “Performance Guarantees for Web Server End-Systems: A Control Theoretical Approach”, IEEE Transactions on Parallel and Distributed Systems, 2002, http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.16.8551&rep=rep1&type=pdf, IEEE Press, Piscataway, NJ, USA.
Marcos K. Aguilera, “Performance Debugging for Distributed Systems of Black Boxes”, SOSP, Oct. 19-22, 2003, Bolton Landing, NY, USA.
Guillermo A. Alvarez, “Minerva: An Automated Resource Provisioning Tool for Large-Scale Storage Systems”, ACM Transactions on Computer Systems (TOCS), Nov. 2001, vol. 19, Issue 4, pp. 483-518, ACM, New York, NY, USA.
Paul Barham, “Magpie: Online Modelling and Performance-Aware Systems”, Proceedings of the 9th conference on Hot Topics in Operating Systems , 2003, vol. 9, USENIX Association Berkeley, CA, USA.
Bayesian Network Classifier Toolbox, jBNC Toolkit, website, http://jbnc.sourceforge.net/.
John Binder, “Adaptive Probabilistic Networks with Hidden Variables”, revised Mar. 24, 1997, http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.53.3318&rep=rep1&type=pdf.
Anna Bartkowiak, “Neural Networks for Pattern Recognition”, Lecture Notes, Jan. 26, 2005, pp. 1-56, Institute of Computer Science, University of Wroclaw, http://www.ii.uni.wroc.pl/˜aba/teach/roadmap.pdf.
Mike Y. Chen, “Pinpoint: Problem Determination in Large, Dynamic Systems”, Power Point. http://www.cs.rutgers.edu/˜ryder/oosem05/lectures/Pinpoint-Ophelia.pdf.
Mike Y. Chen, “Path-Based Failure and Evolution Management”, Proceedings of the 1st conference on Symposium on Networked Systems Design and Implementation, 2004, pp. 23-23, USENIX Association Berkeley, CA, USA.
David D. Clark, “A Knowledge Plane for the Internet”, SIGCOMM, Aug. 25-29, 2003, ACM, Karlsruhe, Germany.
Carl Czajkowski, “Grid Information Services for Distributed Resource Sharing”, Proceedings of the 10th IEEE International Symposium on High-Performance Distributed Computing, 2001, p. 181, IEEE Computer Society Washington, DC, USA.
Ronald P. Doyle, “Model-Based Resource Provisioning in a Web Service Utility”, Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems, 2003, p. 5, USENIX Association Berkeley, CA, USA.
Richard O. Duda, “Pattern Classification and Scene Analysis”, 1973, pp. 482, Wiley-Interscience, New York, NY, USA, http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=1100577&isnumber=24138.
Armando Fox, “Self-Repairing Computers”, Scientific American Magazine, Jun. 2003, 8 pages, http://www.sciamdigital.com/index.cfm?fa=Products.ViewIssuePreview&ARTICLEID—CHAR=08B277E7-FBF6-D7D6-79555B73B319013F.
Nir Friedman, “Bayesian Network Classifiers”, Machine Learning, 1997, pp. 131-163, vol. 29, Issue 2-3, Kluwer Academic Publishers Hingham, MA, USA.
Nir Friedman, “Sequential Update of Bayesian Network Structure”, Proceedings of the 13th Conference on Uncertainty in Artificial Intelligence, 1997, http://www.cs.huji.ac.il/˜nir/Papers/FrG4.pdf.
Pankaj K. Garg, “Web Transaction Analysis and Optimization”,Software Technology Laboratory, HP Laboratories Palo Alto, Feb. 28, 2002, http://www.hpl.hp.com/techreports/2002/HPL-2002-45.pdf.
Trevor Hastie, “The Elements of Statistical Learning”, http://mail.ustc.edu.cn/˜fygood/papers/ms/Statistical%20Learning%20for%20Statisticians.pdf.
David Heckerman, “Learning Bayesian Networks: The Combination of Knowledge and Statistical Data”, Technical Report, MSR-TR-94-09, Mar. 1994, pp. 1-53, http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.30.7910&rep=rep1&type=pdf.
Ryan Huebsch, “Querying the Internet with PIER”, Proceedings of the 29th VLDB Conference, 2003, 12 pages, Berlin, Germany.
Ross Ihaka, “R: A Language for Data Analysis and Graphics”, Journal of Computational and Graphical Statistics, 1996, pp. 299-314, vol. 5, No. 3.
Jeffrey O. Kephart, “The Vision of Autonomic Computing”, IEEE, 2003, pp. 41-50, vol. 36, Issue 1, IEEE Computer Society Press Los Alamitos, CA, USA.
Michael Mesnier, “File Classification in Self-Storage Systems”, Proceedings of the 1st International Conference on Autonomic Computing, May 2004, 8 pages, New York, NY, USA.
David Mosberger, “httperf—A Tool for Measuring Web Server Performance”, ACM SIGMETRICS Performance Evaluation Review, Dec. 1998, pp. 31-37, vol. 26, Issue 3, ACM New York, NY, USA.
J. Ross Quinlan, “C4.5 Programs for Machine Learning”, Machine Learning, 1993, Morgan Kaufmann Publishers Inc., San Francisco, CA, USA.
David Gerard Sullivan, “Using Probabilistic Reasoning to Automate Software Tuning”, Sep. 2003, 212 pages, Harvard University Cambridge, MA, USA.
Robbert Van Renesse, “Astrolabe: A Robust and Scalable Technology for Distributed System Monitoring, Management, and Data Mining”, ACM Transactions on Computer Systems, 2003, pp. 164-206, vol. 21, Issue 2, ACM New York, NY, USA.
Mike Wawrzoniak, “Sophia: An Information Plane for Networked Systems”, Planet Lab, ACM SIGCOMM Computer Communication Review, 2004, pp. 15-20, vol. 34, Issue 1, ACM New York, NY, USA.
Bodik Peter
Goldszmidt Moises
Ehne Charles
Microsoft Corporation
Vierra Magen Marcus & DeNiro LLP
LandOfFree
Automated identification of performance crisis does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Automated identification of performance crisis, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Automated identification of performance crisis will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4313354