Data processing: database and file management or data structures – Database design – Data structure types
Patent
1997-02-14
1999-05-04
Black, Thomas G.
Data processing: database and file management or data structures
Database design
Data structure types
707 1, 707 5, G06F 1730
Patent
active
058999922
ABSTRACT:
A method, apparatus, and article of manufacture for a computer implemented scaleable set-oriented classifier. The scalable set-oriented classifier stores set-oriented data as a table in a relational database. The table is comprised of rows having attributes. The scalable set-oriented classifier classifies the rows by building a classification tree. The scalable set-oriented classifier determines a gini index value for each split value of each attribute for each node that can be partitioned in the classification tree. The scalable set-oriented classifier selects an attribute and a split value for each node that can be partitioned based on the determined gini index value corresponding to the split value. Then, the scalable set-oriented classifier grows the classification tree by another level based on the selected attribute and split value for each node. The scalable set-oriented classifier repeats this process until each row of the table has been classified in the classification tree.
REFERENCES:
Ganski, R. A., et al., "Optimization of Nested SQL Queries Revisited", Proceedings of Association for Computing Machinery Special Interest Group on Management of Data, May 27-29, 1987, pp. 23-33.
Muralikrishna, M., "Improved Unnesting Algorithms for Join Aggregate SQL Queries", Proceedings of the 18th International Conference on Very Large Data Bases, Aug. 23-27, 1992, pp. 91-102.
Quinlan, J.R., "Combining Instance-Based and Model-Based Learning", Machine Learning, Proceedings of the Tenth International Conference, Jun. 27-29, 1993, pp. 236-243.
Agrawal, R., "Database Mining: A Performance Perspective", IEEE Transactions on Knowledge and Data Engineering, vol. 5, No. 6, Dec. 1993, pp. 914-925.
Baru, C.K., et al., "DB2 Parallel Edition", IBM Systems Journal , vol. 34, No. 2, 1995, pp. 292-322.
Houtsma, M., et al., "Set-Oriented Mining for Association Rules in Relational Databases", Eleventh International Conference on Data Engineering, Mar. 6-10, 1995, pp. 25-33.
Bhargava, G., et al., "Hypergraph based reorderings of outer join queries with complex predicates", Proceedings of the 1995 ACM SIGMOD, International Conference on Management of Data, May 23-25, 1995, pp. 304-315.
Sarawagi, S., "Query Processing in Tertiary Memory Databases", Proceedings of the 21st International Conference on Very Large Data Bases, Sep. 11-15, 1995, pp. 585-596.
Sarawagi, S., et al., "Reordering Query Execution in Tertiary Memory Databases", Proceedings of the 22nd VLDB Conference, 1996, pp. 156-167.
Shafer, J., et al., "SPRINT: A Scalable Parallel Classifier for Data Mining", Proceedings of the 22nd VLDB Conference, 1996, pp. 544-555.
Mehta, M., et al., "SLIQ: A Fast Scalable Classifier for Data Mining", 5th International Conference on Extending Database Technology, Mar. 25-29, 1996, pp. 18-32.
Nguyen, T., et al., "Accessing Relational Databases from the World Wide Web", Proceedings ACM SIGMOD International Conference on Management of Data, Jun. 4-6, 1996, pp. 529-540.
Iyer Balakrishna Raghavendra
Wang Min
Black Thomas G.
Coby Frantz
International Business Machines - Corporation
LandOfFree
Scalable set oriented classifier does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Scalable set oriented classifier, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Scalable set oriented classifier will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1867216