Data processing: database and file management or data structures – Database design – Data structure types
Patent
1996-07-31
1998-09-22
Black, Thomas G.
Data processing: database and file management or data structures
Database design
Data structure types
707 2, 707 3, 707 6, G06F 1730
Patent
active
058130020
ABSTRACT:
A method for detecting deviations in a database is disclosed, comprising the steps of: determining respective frequencies of occurrence for the attribute values of the data items, and identifying any itemset whose similarity value satisfies a predetermined criterion as a deviation, based on the frequencies of occurrence. The determination of the frequencies of occurrence includes computing an overall similarity value for the database, and for each first itemset, computing a difference between the overall similarity value and the similarity value of a second itemset. The second itemset has all the data items except those of the first itemset. Preferably, a smoothing factor is used for indicating how much dissimilarity in an itemset can be reduced by removing a subset of items from the itemset. The smoothing factor is evaluated as each item is incrementally removed from the itemset, thereby allowing a data item to be identified as a deviation when the difference if similarity value is the highest.
REFERENCES:
patent: 3670310 (1972-06-01), Bharwani et al.
patent: 3694813 (1972-09-01), Loh et al.
patent: 4290115 (1981-09-01), Pitt et al.
patent: 4653021 (1987-03-01), Takagi
patent: 4800511 (1989-01-01), Tanaka
patent: 4839853 (1989-06-01), Deerweater et al.
patent: 4868750 (1989-09-01), Kucera et al.
patent: 4956774 (1990-09-01), Shibamiya et al.
patent: 5031206 (1991-07-01), Riskin
patent: 5129082 (1992-07-01), Tirfing et al.
patent: 5168565 (1992-12-01), Morita
patent: 5255386 (1993-10-01), Prager
patent: 5276629 (1994-01-01), Reynolds
patent: 5297039 (1994-03-01), Kanaegami et al.
patent: 5351247 (1994-09-01), Dow et al.
patent: 5375235 (1994-12-01), Berry et al.
patent: 5418951 (1995-05-01), Damashek
patent: 5440481 (1995-08-01), Kostoff et al.
patent: 5488725 (1996-01-01), Turtle et al.
patent: 5542089 (1996-07-01), Lindsay et al.
patent: 5544049 (1996-08-01), Henderson et al.
patent: 5544352 (1996-08-01), Egger
patent: 5576954 (1996-11-01), Driscoll
patent: 5598557 (1997-01-01), Doner et al.
patent: 5642502 (1997-06-01), Driscoll
patent: 5659732 (1997-08-01), Kirsch
patent: 5666442 (1997-09-01), Wheeler
patent: 5699509 (1997-12-01), Gary et al.
J. W. Shavlik, T. G. Dietterich, Reading in Machine Learning (book), Morgan Kaufman Pub. Inc., San Mateo, CA, Chapter 3 Unsupervised Concept Learning and Discovery, pp. 263-283, 1990.
P. F. Velleman, D. C. Hoaglin, Applications, Basics, & Computing of Exploratory Data Analysis (book), Chapter 3, pp. 65-81, Chapter 5, pp. 121-147, and Chapter 6, pp. 159-184, date unknown.
D. Chamberlin, Using the New DB2 IBM's Object-Relational Database System, Chapter 5, Active Data, pp. 323-342. (book), date unknown.
D. C. Hoaglin, F. Mosteller, J. W. Tukey, Understanding Robust and Exploratory Data Analysis, (book) J. Wiley & Sons, Inc., pp. 1-55, date unknown.
L. G. Valiant, A Theory of the Learnable (Artificial Intelligence and Language Processing, Inductive Learning from Preclassified Training Examples, pp. 192-200, date unknown.
D. E. Rumelhart and D. Zipser, Freature Discovery by Competitive Learning, Orig. Pub. in Cognitive Science, 9.1, 1985, pp. 307-325.
J. R. Quinlan, 2.2 Algorithms, Induction of Decision Trees, Orig. Pub. in Machine Learning, 1:81-106, 1986, Kluwer Academic Publishers, Boston, pp.57-69.
R. S. Michalski, R. E. Stepp, Learning from Observation: Conceptual Clustering, Chapter 11, pp. 331-362, date unknown.
S. J. Hanson, M. Bauer, Conceptual Clustering, Categorization, and Polymorphy, Machine Learning 3: 343-372, 1989, Kluwer Academic Publishers--Manufactured in The Netherlands.
D. H. Fisher, Knowledge Acquistion via Incremental Conceptual Clustering, Originally Published in Machine Learning, 2: 139-172, 1987 Kluwer Academic Publishers, Boston, 3.2.1 pp. 267-283.
D. Angluin, P. Laird, Learning from Noisy Examples, 1988 Kluwer Academic Publisher, Boston, Manufactured inThe Netherlands, Maching Learning 2: 343-370, 1988.
D. W. Aha, D. Kibler, M. K. Albert, Instance-Based Learning Algorithms Machine Learning, 6, 37-66, 1991, Kluwer Academic Publishers Boston Manufactured in The Netherlands.
Agrawal Rakesh
Arning Andreas
Black Thomas G.
Coby Frantz
International Business Machines - Corporation
Tran Khanh Q.
LandOfFree
Method and system for linearly detecting data deviations in a la does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and system for linearly detecting data deviations in a la, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and system for linearly detecting data deviations in a la will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1635175