Patent
1995-11-13
1998-03-10
Amsbury, Wayne
395605, G06F 1730
Patent
active
057271992
ABSTRACT:
A computer implemented system, two-step method and computer program product which improves the operations of multi-feature extraction and which efficiently develops classification rules from a large training database. Specifically, given a large training set of data tuples, the first phase, called the feature identification phase, identifies features, which have good power in separating data tuples, based on a subset of the training set. A feature that has a good power in correlating data tuples into groups is said to have a good discriminating power. Since the feature identification phase is performed on a subset of the training set, processing costs are minimized. Limiting this phase to the identification of features having good discriminating power ensures that the use of a subset of the training set does not adversely affect the validity of the conclusions drawn therefrom. In the second phase, called the feature combination phase, the identified features are evaluated in combination against the entire training set to determine the final classification rules. The prior identification of features having good discriminating power advantageously minimizes processing costs during this phase (which is run against the entire training set). The use of multiple features advantageously increases the discrimination power beyond that of the individual features.
REFERENCES:
patent: 5442781 (1995-08-01), Yamagata
patent: 5544352 (1996-08-01), Egger
patent: 5546578 (1996-08-01), Takada
patent: 5598557 (1997-01-01), Doner et al.
patent: 5615341 (1997-03-01), Agrawal et al.
Mannila et al, "Improved Methods for Finding Rules", Pub. No. C-1993-65, 20 pages, University of Helsinki, Dec. 31, 1993.
Agrawal et al, "Mining Association Rules betweem Sets of Items in Large Databases",PROC of the ACM SIGMOD Conference on Management of Data, May 31, 1993, pp. 207-216.
Agrawal et al, "Fast Algorithms for Mining Association Rules", Proc. of the 20th VLDB Conference, Santiago, Chile, Dec. 31, 1994.
R. Agrawal et al., "An Interval Classifier for Datatbase Mining Applications", Proceedings of the 18th International Conference on Very Large Data Bases, p. 560-573, Aug.1992.
J.R. Quinlan, "Induction of Decision Trees", Machine Learning, vol. 1, p. 81-106, 1986.
Chen Ming-Syan
Yu Philip Shi-Ling
Amsbury Wayne
International Business Machines - Corporation
Jordan Kevin M.
LandOfFree
Database mining using multi-predicate classifiers does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Database mining using multi-predicate classifiers, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Database mining using multi-predicate classifiers will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-149537