Using object relational extensions for mining association rules

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Reexamination Certificate

active

06301575

ABSTRACT:

BACKGROUND OF THE INVENTION
1. Field of the Invention
This invention relates in general to computer implemented data mining, and in particular to using object relational extensions for mining association rules.
2. Description of Related Art
There has been a rapid growth in the automation of data collection procedures in the last decade. This has led to a vast growth in the amount of usable data. Translating this usable data to useful information requires the use of a variety of data mining and knowledge extraction techniques. Accompanying these developments has been the growth of reliable, highly optimized relational database systems. As more and more data stores begin to rely on these database systems, the integration of the mining techniques with the database systems becomes desirable. However, efficient utilization of database systems as mining engines requires some modifications to the relational database system and to data organization.
Data mining is the process of finding interesting patterns in data. Data mining retrieves interesting data from a very large database, such as a database describing existing, past, or potential clients that may have thousands of attributes. A database is a set of records that are described by a set of attributes which have values.
Conventional data mining techniques do not work well on a database with a large number of attributes. In particular, most conventional data mining techniques only work one data in memory. Therefore, if the data is so large that it must be stored other than in memory, the data mining techniques will move data into memory to operate on the data, which is inefficient both in terms of memory usage and time.
The successful automation of data collection and the growth in the importance of information repositories have given rise to numerous data stores, ranging from those of large scientific organizations, banks and insurance companies, to those of small stores and businesses. The abundance of data has required the use of innovative and intricate data warehousing and data mining techniques to summarize and make use of the data.
There has been significant activity in developing new techniques for knowledge extraction, which is described further in G. Piatetsky-Shapiro and W. J. Frawley,
Knowledge Discovery in Databases
, AAAI/MIT Press, 1991, which is incorporated by reference herein. Some of the new techniques are for classification of data, which is further described in S. M. Weiss and C. A. Kulikowski,
Computer Systems that Learn: Classification and Prediction Methods from Statistics, Neural Nets, Machine Learning and Expert Systems
, Morgan-Kauftnann, 1991; R. Agrawal, S. Ghosh, T. Imielinski, B. Iyer, and A. Swami,
An Interval Classifier For Database Mining Applications
, Proceedings of the 18th International Conference on Very Large Databases, pages 560-573, August 1992; which are incorporated by reference herein.
Some of the new techniques for knowledge extraction are for clustering of data. T. Zhang, R. Ramakrishnan, and M. Livny, Birch,
An Efficient Data Clustering Method For Very Large Databases
, Proceedings of the 1996 ACM SIGMOD International Conference of Management of Data, 1996; R. T. Ng and J. Han,
Efficient And Effective Clustering Methods For Spatial Data Mining
, Proceedings of the 20th International Conference on Very Large Databases, 1994; A. K. Jain and R. C. Dubes,
Techniques For Clustering Data
, Prentice-Hall, 1988; L. Kaufman and P. J. Rousseeuw,
Finding Groups In Data—An Introduction To Cluster Analysis
, Wiley, 1990, which are incorporated by reference herein.
Some of the techniques for knowledge extraction are for discovery of association rules, and association rules are derived from and used to represent frequently occurring patterns within the database. R. Agrawal., T. Imielinski, and A. Swami,
Mining Association Rules Between Sets Of Items In Large Databases
, Proceedings of SIGMOD '93, pages 207-216, May 1993; R. Agrawal and R. Srikant,
Fast Techniques For Mining Association Rules
, Proceedings of the 20th International Conference on Very Large Databases, September 1994, [hereinafter “Fast Techniques For Mining Association Rules”]; M. Houtsma and A. Swami,
Set
-
Oriented Mining Of Association Rules
, Technical Report RJ 9567, IBM Almaden Research Center, October 1993, [hereinafter “Set-Oriented Mining of Association Rules”]; J. S. Park, M. S. Shen, and P. S. Yu,
An Effective Hash Based Technique For Mining Association Rules
, Proceedings of SIGMOD '95, May 1995; R. Agrawal, H. Mannila, R. Srikant, H. Toivonen, and A. I. Verkamo,
Fast Discovery Of Association Rules
, Advances in Knowledge Discovery and Data Mining, AAAI/MIT Press, edited by U. M. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy, 1995; H. Toivonen,
Sampling Large Databases For Association Rules
, Proceedings of the 22nd International Conference on Very Large Databases; A. Savasere, E. Omiecinski, and S. Navathe,
An Efficient Technique For Mining Association Rules In Large Databases
, Proceedings of the 21 st International Conference on Very Large Databases, September 1995; H. Mannila, H. Toivonen, and A. I. Verkamo,
Efficient Techniques For Discovering Association Rules
, Technical Report WS-94-03, American Association for Artificial Intelligence, 1994; R. Srikant and R. Agrawal,
Mining Generalized Association Rules
, Proceedings of the 21 st International Conference on Very Large Databases, September 1995; J. Han and Fu,
Discovery Of Multiple-Level Association Rules From Large Databases
, Proceedings of the 21st International Conference on Very Large Databases, September 1995; J. Han, Y. Cai, and N. Cercone,
Data—Driven Discovery Of Quantitative Rules In Relational Databases
, IEEE Transactions on Knowledge and Data Engineering, Vol. 5(1), pages 29-40, 1993; R. Srikant and R. Agrawal,
Mining Ouantitative Association Rules In Large Relational Tables
, Proceedings of the 1996 ACM SIGMOD International Conference of Management of Data, 1996; T. Fukuda, Y. Morimoto, S. Morishita, and T. Tokuyama,
Mining Optimized Association Rules For Numeric Attributes
, Proceedings of the 1996 ACM Symposium on Principles of Database Systems, 1996; R. Miller and Y. Yang,
Association Rules Over Interval Data
, Proceedings of SIGMOD '97, 1997; which are incorporated by reference herein.
Some of the techniques for knowledge extraction are for sequential patterns. R. Agrawal and R. Srikant,
Mining Sequential Patterns
, Proceedings of the 11th International Conference on Data Engineering, March 1995, which is incorporated by reference herein. Some of the techniques for knowledge extraction are for similarities in ordered data. R. Agrawal, C. Faloutsos, and A. Swami,
Efficient Similarity Search In Sequence Databases
, 4th International Conference on Foundations of Data Organization and Techniques, October 1993; C. Faloutsos, M. Ranganathan, and Y. Manolopoulos,
Fast Subsequence Matching In Time-Series Databases
, Proceedings of SIGMOD '94, May 1994; R. Agrawal, K. I. Lin, H. S. Sawhney, and K. Shim,
Fast Similarity Search In The Presence Of Noise, Scaling And Translation In Time
-
Series Databases
, Proceedings of the 21st International Conference on Very Large Databases, September 1995; R. Agrawal, G. Psaila, E. L. Wimmers, and M. Zait,
Querying Shapes Of Histories
, Proceedings of the 21st International Conference on Very Large Databases, September 1995; which are incorporated by reference herein.
Houtsma and Swami, in “Set-Oriented Mining of Association Rules”, had proposed SETM, an SQL based technique for association. Their technique uses simple database operations (e.g., sorting and merge-scan joins) for performing association. However, their joins are more expensive as they are against the input data table and they do not have an efficient candidate set pruning such as the Apriori technique.
The size and growth of the data stores, matched by the growing reliability and large-volume handling capability of relational database systems, has caused much of the data to be

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Using object relational extensions for mining association rules does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Using object relational extensions for mining association rules, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Using object relational extensions for mining association rules will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2583286

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.