System and method for selection of important attributes

Data processing: database and file management or data structures – Database design – Data structure types

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

707 2, 707 3, 707 5, 707 7, G06F 1730

Patent

active

060263997

ABSTRACT:
A system and method determines how well various attributes in a record discriminate different values of a chosen label attribute. An attribute is considered a relevant attribute if it discriminates different values of a chosen label attribute either alone or in conjunction with other attributes. According to the present invention, a label attribute is selected by a user from a set of records, with each record having a plurality of attributes. Next, one or more first important attributes considered important by the user are selected. The present invention then generates one or more second important attributes. The second important attributes together with the user chosen first important attributes discriminate well between different values of the label attribute. A measure called "purity" (a number from 0 to 100) informs how well each attribute discriminates the different label attributes. The purity measure allows the attributes to be ranked based on their importance.

REFERENCES:
patent: 3816726 (1974-06-01), Sutherland et al.
patent: 4719571 (1988-01-01), Rissanen et al.
patent: 4868771 (1989-09-01), Quick et al.
patent: 4928247 (1990-05-01), Doyle et al.
patent: 4994989 (1991-02-01), Usami et al.
patent: 5043920 (1991-08-01), Malm et al.
patent: 5072395 (1991-12-01), Bliss et al.
patent: 5150457 (1992-09-01), Behm et al.
patent: 5164904 (1992-11-01), Sumner
patent: 5201047 (1993-04-01), Maki et al.
patent: 5247666 (1993-09-01), Buckwold
patent: 5251131 (1993-10-01), Masand et al.
patent: 5253333 (1993-10-01), Abe
patent: 5282262 (1994-01-01), Kurashige
patent: 5295243 (1994-03-01), Robertson et al.
patent: 5307456 (1994-04-01), MacKay
patent: 5325445 (1994-06-01), Herbert
patent: 5418946 (1995-05-01), Mori
patent: 5420968 (1995-05-01), Johri
patent: 5426780 (1995-06-01), Gerull et al.
patent: 5459829 (1995-10-01), Doi et al.
patent: 5463773 (1995-10-01), Sakakibara et al.
patent: 5519865 (1996-05-01), Kondo et al.
patent: 5528735 (1996-06-01), Strasnick et al.
patent: 5553163 (1996-09-01), Nivelle
patent: 5555354 (1996-09-01), Strasnick et al.
patent: 5604821 (1997-02-01), Ranganathan et al.
patent: 5634087 (1997-05-01), Mammone et al.
patent: 5659731 (1997-08-01), Gustafson
patent: 5671333 (1997-09-01), Catlett et al.
patent: 5675711 (1997-10-01), Kephart et al.
patent: 5675785 (1997-10-01), Hall et al.
patent: 5675786 (1997-10-01), McKee et al.
patent: 5680476 (1997-10-01), Schmidt et al.
patent: 5694524 (1997-12-01), Evans
patent: 5696964 (1997-12-01), Cox et al.
patent: 5706495 (1998-01-01), Chadha et al.
patent: 5724573 (1998-03-01), Agrawal et al.
patent: 5727199 (1998-03-01), Chen et al.
patent: 5732230 (1998-03-01), Cullen et al.
patent: 5737487 (1998-04-01), Bellegarda et al.
patent: 5748852 (1998-05-01), Mahler
patent: 5787274 (1998-07-01), Agrawal et al.
Amsbury, W., Data Structures from Arrays to Priority Queues, Wadsworth Publishing, Belmont, CA, pp. viii and 331-336, Copyright 1985.
Date et al., A Guide to SQL/DS, Addison-Wesley Publishing, New York, NY, pp. xiii and 97-118, Copyright 1989.
Hecht-Nielsen, R., Neurocomputing, Addison-Wesley Publishing, New York, NY, pp. ix-xiii and 175-201, Copyright 1990.
Hsiao et al., "Supervised Textured Image Segmentation Using Feature Smoothing and Probabilistic Relaxation Techniques," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 11, No. 12, pp. 1279-1292, Dec. 1989.
Robert et al., "Continuously Evolving Classification of Signals Corrupted by an Abrupt Change," IEEE--IMS Workshop on Information Theory and Statistics, p. 97, Oct. 1994.
Santos-Victor et al., "A Computer Vision System for the Characterization and Classification of Flames in Glass Furnaces," IEEE Transactions on Industry Applications, vol. 29, No. 3, pp. 470-478, May/Jun. 1993.
Taxt et al., "Segmentation of Document Images," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 11, No. 12, pp. 1322-1329, Dec. 1989.
Ahlberg et al., "IVEE: An Information Visualization & Exploration Environment," Proceedings of Information Visualization '95, 1995, pp. 66-73.
Becker et al., "Smooth Transitions between Bump Rendering Algorithms," Computer Graphics Proceedings, Annual Conference Series, 1993, pp. 183-190.
Becker et al., "Unsteady Flow Volumes," Proceedings of Visualization '95, pp. 329-335.
Blinn, James F., "Light Reflection Functions for Simulation of Clouds and Dusty Surfaces," Computer Graphics, vol. 16, No. 3, Jul. 1982, pp. 116-124.
Crawfis et al., "Texture Splats for 3D Scalar and Vector Field Visualization," Proceedings of IEEE Visualization '93, 1993, pp. 261-265.
Crawfis et al., "Vector Field Visualization," Computer Graphics and Applications, vol. 14, 1994, pp. 50-56.
Inselberg et al., "Parallel Coordinates: A Tool for Visualizating Multidimensional Geometry," Proceedings of Visualization '90, pp. 361-378.
Laur et al., "Hierarchical Splatting: A Progressive Refinement Algorithm for Volume Rendering," Computer Graphics, vol. 25, No. 4, Jul. 1991, pp. 285-288.
Levoy, Marc, "Volume Rendering: Display of Surfaces from Volume Data," IEEE Computer Graphics and Applications, vol. 8, No. 5, May 1988, pp. 29-37.
Martin et al., "High Dimensional Brushing for Interactive Exploration of Multivariate Data," Proceedings of Visualization '95, 1995, pp. 271-278.
Max et al., "Bump Shading for Volume Textures," IEEE Computer Graphics and Applications, Jul. 1994, pp. 18-20.
Max et al., "Flow Volumes for Interactive Vector Field Visualization," Proceedings of Visualization '93, 1993, pp. 19-24.
Sabella, Paolo, "A Rendering Algorithm for VIsualizing 3D Scalar Fields," Computer Graphics, vol. 22, No. 4, Aug. 1988, pp. 51-58.
Stein et al., "Sorting and Hardware Assisted Rendering for Volume Visualization," IEEE, 1995, pp. 83-89.
Van Wijk et al., "HyperSlice," Proceedings of Visualization '93, 1993, pp. 119-125.
Westover, Lee, "Footprint Evaluation for Volume Rendering," Computer Graphics, vol. 24, No. 4, Aug. 1990, pp. 367-376.
Wilhelms et al., "A Coherent Projection Approach for Direct Volume Rendering," Computer Graphics, vol. 25, No. 4, Jul. 1991, pp. 275-284.
Wong et al., "Dual Multiresolution HyperSlice for Multivariate Data Visualization," IEEE Symposium on Information Visualization, Oct. 1996, pp. 74-75.
Aha, D.W. et al., "Instance-Based Learning Algorithms," Machine Learning, vol. 6, No. 1, pp. 37-66 (Jan. 1991).
Almuallim, H. and Dietterich, T.G., "Learning Boolean Concepts in the Presence of Many Irrelevant Features," Artificial Intelligence, vol. 69, Nos. 1-2, pp. 279-305 (Sep. 1994).
"Angoss Software Announces Knowledge Studio Data Mining Solution," http://www.pathfinder.com/@@xIEkOgYAVjbJZjKM/money/latest/press/PW/19970ct 27/92, Angoss Software Corporation, pp. 1-2, Oct. 1997.
"Angoss Software's KnowledgeSeeker(.TM.) Compatible With SAS Institute," http://www.newswire.ca/releases/September.1997/18/c3915.html. pp. 1-2, Canada Newswire, Sep. 1997.
Breiman et al., Classification and Regression Trees, Wadsworth International Group, entire book (1984).
Cestnik, B., "Estimating Probabilities: A Crucial Task in Machine Learning," Proceedings of the 9th European Conference on Artificial Intelligence, pp. 147-149 (Aug. 1990).
"Companies in Data Mining and Knowledge Discovery," http://kdnuggets.com/companies.html. pp. 1-4, Last updated: Oct. 31, 1997.
Cormen, T.H., et al., Introduction to Algorithms, The MIT Press, pp. 263-280 (1990).
Cover and Thomas, Elements of Information Theory, Wiley Interscience, entire book, 1991.
Dasarathy, B.V., "Nearest Neighbor (NN) Norms: (NN) Patterns Classification Techniques," (IBL), IEEE Computer Society Press, pp. 1-30 (1990).
"Data Mining and Knowledge Discovery References," http://kdnuggets.com/references.html, pp. 1-3, Last updated: Oct. 29, 1997.
Domingos, P. and Pazzani, M., "Beyond Independence: Conditions for the Optimality of the Simple Bayesian Classifier," Machine Learning: Proceedings of the 13th International Conference (ICML '96), pp. 105-112 (1996).
Duda, R. and Hart, P., Pattern Classification and Scene Analysis, Wiley, entire book, (1973).
Fairchild, K.M., "Information Management Using V

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

System and method for selection of important attributes does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with System and method for selection of important attributes, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for selection of important attributes will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-1914645

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.