System and method for continuous diagnosis of data streams

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000

Reexamination Certificate

active

07464068

ABSTRACT:
In connection with the mining of time-evolving data streams, a general framework that mines changes and reconstructs models from a data stream with unlabeled instances or a limited number of labeled instances. In particular, there are defined herein statistical profiling methods that extend a classification tree in order to guess the percentage of drifts in the data stream without any labelled data. Exact error can be estimated by actively sampling a small number of true labels. If the estimated error is significantly higher than empirical expectations, there preferably re-sampled a small number of true labels to reconstruct the decision tree from the leaf node level.

REFERENCES:
patent: 6442561 (2002-08-01), Gehrke et al.
patent: 6938049 (2005-08-01), Kamath et al.
patent: 2002/0082886 (2002-06-01), Manganaris et al.
patent: 2003/0220860 (2003-11-01), Heytens et al.
patent: 2005/0091524 (2005-04-01), Abe et al.
Aggarwal, C.C., A framework for diagnosing changes in evolving data streams, In Proceedings of ACM SIGMO 2003, pp. 575-586, 2003.
Babcock et al., Models and Issues in Dataq Stream Systems, In ACM Symposium on Principles of Database Systems (PODS), 2002.
Babu et al., Continuous queries over data streams, SIGMOD Record, 30:109-120, 2001.
Chen et al., Multi-dimensional regression analysis of time-series data streams, In Proc. of Very Large Database (VLDB), Hongkong, China, 2002.
Domingos et al., Mining high-speed data streams, In Int'l Conf. ofn Knowledge Discovery and Data Mining (SIGKDD), pp. 71-80, Boston, MA, 2000, ACM Press.
Gao et al., Continually evaluating similarity-based pattern queries on a streaming time series, In Int'l Conf. Management of Data (SIGMOD), Madison, WI, Jun. 2002.
Gehrke et al., Boat—Optimistic decision tree construction, In Int'l Conf. Management of Data (SIGMOD), 1999.
Greenwald et al., Space-efficient online computation of quantile summaries. In Int'l Conf. Management of Data (SIGMOD), pp. 58-66, Santa Barbara, CA, May 2001.
Guha et al., Clustering data streams, In IEEE Symposium on Foundations of Computer Science (FOCS), pp. 359-366, 2000.
Hulten et al., Mining Time-changing data streams, In Int'l Conf. on Knowledge Discovery and Data Mining (SIGKDD), pp. 97-106, San Francisco, CA, 2001, ACM Press.
Shafer et al., Spring: A scalable parallel classifier for data mining, In Proc. of Very Large Database (VLDB), 1996.
Stolfo et al., Credit card fraud detection using meta-learning: Issues and initial results, In AAAI-97 Workshop on Fraud Detection and Risk Management, 1997.
Street et al., A streaming ensemble algorithm (SEA) for large-scale classification, In Int'l Conf. on Knowledge Discovery and Data Mining (SIGKDD), 2001.
Wang et al., Mining concept-drifting data streams using ensemble classifiers, In Proceedings of ACM SIGKDD International Conf. on Knowledge Discovery and Data Mining (SIGKDD), 2003.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

System and method for continuous diagnosis of data streams does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with System and method for continuous diagnosis of data streams, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for continuous diagnosis of data streams will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4045554

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.