Using data mining algorithms including association rules and...

Data processing: artificial intelligence – Knowledge processing system – Knowledge representation and reasoning technique

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C706S059000, C707S776000, C707S797000, C707SE17044

Reexamination Certificate

active

07836004

ABSTRACT:
Provided are a method, system, and article of manufacture for using a data mining algorithm to discover data rules. A data set including multiple records is processed to generate data rules for the data set. Each record has a record format including a plurality of fields and each rule provides a predicted condition for one field based on at least one predictor condition in at least one other field. The generated data rules are provided to a user interface to enable a user to edit the generated data rules. The data rules are stored in a rule repository to be available to use to validate data sets having the record format.

REFERENCES:
patent: 5615341 (1997-03-01), Agrawal et al.
patent: 5794209 (1998-08-01), Agrawal et al.
patent: 5813002 (1998-09-01), Agrawal et al.
patent: 6078918 (2000-06-01), Allen et al.
patent: 6182070 (2001-01-01), Megiddo et al.
patent: 6272478 (2001-08-01), Obata et al.
patent: 6278997 (2001-08-01), Agrawal et al.
patent: 6836773 (2004-12-01), Tamayo et al.
patent: 6850947 (2005-02-01), Chung et al.
patent: 6877012 (2005-04-01), Ashida et al.
patent: 6941303 (2005-09-01), Perrizo
patent: 6954756 (2005-10-01), Arning et al.
patent: 6965888 (2005-11-01), Cesare et al.
patent: 7266537 (2007-09-01), Jacobsen et al.
patent: 2003/0191667 (2003-10-01), Fitzgerald et al.
patent: 2004/0093559 (2004-05-01), Amaru et al.
patent: 2004/0189708 (2004-09-01), Larcheveque et al.
patent: 2004/0226002 (2004-11-01), Larcheveque et al.
patent: 2005/0060313 (2005-03-01), Naimat et al.
patent: 2005/0066240 (2005-03-01), Sykes et al.
patent: 2005/0066263 (2005-03-01), Baugher
patent: 2005/0108631 (2005-05-01), Amorin et al.
patent: 2005/0144552 (2005-06-01), Kalthoff et al.
patent: 2005/0182739 (2005-08-01), Dasu et al.
patent: 2006/0136461 (2006-06-01), Lee et al.
patent: 2006/0167579 (2006-07-01), Fujii et al.
patent: 2006/0253435 (2006-11-01), Nishizawa et al.
patent: 2006/0274760 (2006-12-01), Loher
patent: 2007/0073688 (2007-03-01), Fry
patent: 1435781 (2003-08-01), None
patent: 1145901 (2004-04-01), None
Janta-Polczynski, M. and E. Roventa, “Fuzzy Measures for Data Quality”, 18th International Conference of the North American Fuzzy Information Processing Society, Jul. 1999, pp. 398-402.
Marchetti, C., M. Mecella, M. Scannapieco, and A. Virgillito, “Enabling Data Quality Notification in Cooperative Information Systems through a Web-Service Based Architecture”, Proceedings of the Fourth International Conference on Web Information Systems Engineering, 2003, 4pp.
Morgan, Reish, Stone, Swearingen, “Implementation of Comprehensive Qualification and Validation of Entry Fields”, TDB vol. 38, No. 2, Feb. 1995, pp. 317-318.
Seekamp, C. and K. Britton, “Dynamic Generation of Rules from Properties to Improve Rule Processing Performance”, RD No. 429, Jan. 2000, pp. 172.
Shipway and Tricker, “Data Validation and Correction by Context”, TDB Sep. 1971, pp. 1132-1137.
U.S. Appl. No. 11/779,251, filed Jul. 17, 2007, entitled “Managing Validation Models and Rules to Apply to Data Sets”, invented by Labrie, J.J., G. Agrawal, M.A. Roth, & Y. Saillet, 34 pp.
U.S. Appl. No. 11/769,639, filed Jun. 27, 2007, entitled “Using a Data Mining Algorithm to Generate Format Rules Used to Validate Data Sets”, invented by Labrie, J.J., D. Meeks, M.A. Roth, & Y. Saillet, 30 pp.
U.S. Appl. No. 11/769,634, filed Jun. 27, 2007, entitled “Using a Data Mining Algorithm to Generate Rules Used to Validate a Selected Region of a Predicted Column”, invented by Roth, M.A. & Y. Saillet, 42 pp.
Wang, R.Y., H.B. Kon, and S.E. Madnick, “Data Quality Requirements Analysis and Modeling”, Proceedings of the Ninth International Conference on Data Engineering, 1999, pp. 670-677.
Wikipedia, “N-gram”, [online], [retrieved on May 13, 2007]. Retrieved from the Internet at <URL: http://en.wikipedia.org/wiki/N-gram>, 3 pp.
Data Mining Group, “Association Rules”, PMML 3.1, [online], [retrieved on Nov. 1, 2006], retrieved from the Internet at <URL: http://www.dmg.org/v3-1/AssociationRules.html>, 7 pp.
Data Mining Group, “Trees”, PMML 3.1, [online], [retrieved on Nov. 1, 2006], retrieved from the Internet at <URL: http://www.dmg.org/v3-1/TreeModel.html>, 18 pp.
Han, E., G. Karypis, & V. Kumar, “Scalable Parallel Data Mining for Association Rules”, Proceedings of the 1997 ACM SIGMOD International Conference on Management of Data, 1997, pp. 277-288.
Hipp, J., U. Guntzer, & U. Grimmer, “Data Quality Mining- Making A Virtue of Necessity”, Proceedings of the 6th ACM SIGMOND Workshop on Research Issues in Data Mining and Knowledge Discovery, 2001, pp. 52-57.
Korn, F., A. Labrinidis, Y. Kotidis, & C. Faloutsos, “Quantifiable Data Mining Using Ratio Rules”, The VLDB Journal, 2000, pp. 254-256.
Marcus, A., J.I. Maletic, & K. Lin, “Ordinal Association Rules for Error Identification in Data Sets”, Proceedings of the Tenth International Conference on Information and Knowledge Management, 2001, pp. 589-591.
Muller, H., U. Leser, & J. Freytag, “Mining for Patterns in Contradictory Data”, Proceedings of the 2004 International Workshop on Information Quality in Information Systems, 2004, pp. 51-58.
Pudi, V., “Data Mining- Association Rules”, [online], [retrieved on Nov. 1, 2006], retrieved from the Internet at <URL: http://www.iiit.ac.in/˜vikram/mining.html>, 3 pp.
Wikipedia, “Decision Tree”, [online], [retrieved on Nov. 1, 2006], retrieved from the Internet at <URL: http://en.wikipedia.org/w/index.php?title=Decision—tree&printable=yes>, 7 pp.
Knobbe, A.J., “Multi-Relational Data Mining”, Nov. 22, 2004, 130 pp.
Shekhar, S., B. Hamidzadeh, A. Kohli, & M. Coyle, “Learning Transformation Rules for Semantic Query Optimization: A Data-Driven Approach”, IEEE Transactions on Knowledge and Data Engineering, Vol. 5, Iss. 6, Dec. 1993, pp. 1-29.
U.S. Appl. No. 12/165,549, filed Jun. 30, 2008, entitled “Discovering Transformations Applied to a Source Table to Generate a Target Table”, invented by Bittner, T., H. Kache, M.A. Roth, and Y. Saillet, 49 pp.
Wikipedia, “Apriori Algorithm”, [online], Updated May 22, 2006, [retrieved on Jun. 20, 2008], retrieved from the Internet at <URL: http://en.wikipedia.org/w/index.php?title=apriori—algorithm&printible=yes>, 3 pp.
Williams, J., “Tools for Traveling Data”, [online], Jun. 1997, [retrieved on Mar. 25, 2008], retrieved from the Internet at <URL: http://www.dbmsmag.com/9706d16.html>, 10 pp.
English Abstract for CN1435781A, published Aug. 13, 2003, 1 p.
English Abstract for CN1145901C, published Apr. 14, 2004, 1 p.
Jingyi, D., “Survey of Association Rule Data Mining”, © 1994-2009 China Academic Journal Electronic Publishing House, Total 2 pp. [with English Abstract on page 1].
Nesvizhskii, A.I., F.F. Roos, J. Grossman, M. Vogelzang, J.S. Eddes, W. Gruissem, S. Baginsky, and R. Aebersold, “Dynamic Spectrum Quality Assessment and Iterative Computational Analysis of Shotgun Protemic Data”, Molecular & Cellular Proteomics, vol. 5, © 2006, The American Society for Biochemistry and Molecular Biology, Inc., pp. 652-670.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Using data mining algorithms including association rules and... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Using data mining algorithms including association rules and..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Using data mining algorithms including association rules and... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4179066

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.