Data processing: database and file management or data structures – Database design – Database and data structure management
Reexamination Certificate
2011-08-09
2011-08-09
Timblin, Robert (Department: 2167)
Data processing: database and file management or data structures
Database design
Database and data structure management
C704S275000
Reexamination Certificate
active
07996440
ABSTRACT:
One or more classification algorithms are applied to at least one natural language document in order to extract both attributes and values of a given product. Supervised classification algorithms, semi-supervised classification algorithms, unsupervised classification algorithms or combinations of such classification algorithms may be employed for this purpose. The at least one natural language document may be obtained via a public communication network. Two or more attributes (or two or more values) thus identified may be merged to form one or more attribute phrases or value phrases. Once attributes and values have been extracted in this manner, association or linking operations may be performed to establish attribute-value pairs that are descriptive of the product. In a presently preferred embodiment, an (unsupervised) algorithm is used to generate seed attributes and values which can then support a supervised or semi-supervised classification algorithm.
REFERENCES:
patent: 5146406 (1992-09-01), Jensen
patent: 5943670 (1999-08-01), Prager
patent: 6405175 (2002-06-01), Ng
patent: 6668254 (2003-12-01), Matson
patent: 6714939 (2004-03-01), Saldanha
patent: 6785671 (2004-08-01), Bailey
patent: 6986104 (2006-01-01), Green et al.
patent: 7020662 (2006-03-01), Boreham et al.
patent: 7043420 (2006-05-01), Ratnaparkhi
patent: 7043492 (2006-05-01), Neal et al.
patent: 7567976 (2009-07-01), Betz et al.
patent: 2002/0107861 (2002-08-01), Clendinning et al.
patent: 2002/0111863 (2002-08-01), Landesman
patent: 2002/0133479 (2002-09-01), Dippold
patent: 2002/0142277 (2002-10-01), Burstein et al.
Ester, (A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise, AAI, 1996, pp. 226-231).
Naumann (Declarative Data Mining with Conflict Resolution, IBM, 2002, pp. 1-13).
Ghani (Mining the Web to Add Semantics to Retail Data Mining, Springer, 2004, pp. 43-56).
Guha, (A Clustering Algorithm for Categorical Attributes, Elsevier, 2000, pp. 1-24).
Collins (Unsupervised Models for Named Entity Classification, 1999, pp. 100-110).
Songfeng (Unsupervised Clustering Based Reduced Support Vector Machines, IEE, 2003, pp. 821-824).
A.P. Dempster et al., “Maximum Likelihood from Incomplete Data via the EM Algorithm”, Journal of the Royal Statistical Society, Series B, 39 (1):1-38, 1977.
R. Agrawal, et al., “Fast Discovery of Association Rules”, in U. Fayyad et al, editors, Advances in Knowledge Discovery and Data Mining, AAAI Press/The MIT Press, pp. 307-328, 1996.
T. Jaakkolda et al., “Exploiting generative models in discriminative classifiers”, In Advances in NIPS 11, 1999.
D.D. Lewis, “Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval”, in Machine Learning: ECML-98, Tenth European Conference on Machine Learning, pp. 4-15, 1998.
K. Seymour, et al., “Learning Hidden Markov Model Structure for Information Extraction”, in Machine Learning for Information Extraction Papers from the AAAI Workshop, 1999. Tech Rep. WS-99-11, AAAI Press.
U.Y. Nahm, et al., “Text Mining with Information Extraction”, in AAAI 2002 Spring Symposium on Mining Answers from Texts and Knowledge Bases, 2002.
A. McCallum & K. Nigam, A Comparison of Event Models for Naive Bayes Text Classification, in Learning for Text Categorization: Papers from the AAAI Workshop, pp. 41-48, 1998. Tech. Rep. WS-98-05, AAAI Press.
R. Ghani et al, Data Mining on Symbolic Knowledge Extraction from the Web, in Workshop of Text Mining at the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2000.
T. Joachims, “Transductive Inference for Text Classification Using Support Vector Machines.” In Machine Learning; Proceedings of the Sixteenth International Conference, 1999.
M. Craven, et al., “Learning to Construct Knowledge Bases from the World Wide Web”, Artificial Intelligence, 118 (1-2):69-114, 2000.
A. Blum & T. Mitchell, “Combining Labeled and Unlabeled Data with Co-Training”, Proceedings of the 1998 Conference on Computational Learning Theory.
E. Brill, “Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in the Part-of-Speech Tagging”, Association for Computational Linguistics, 1995.
M. Collins & Y. Singer, “Unsupervised Models for Named Entity Classification”, AT&T Labs—Research.
J. Ben Schafer, et al., “E-Commerce Recommendation Applications”, GroupLens Research Project, Department of Computer Science and Engineering, University of Minnesota.
R. Ghani & A. Fano, “Building Recommender Systems using a Knowledge Base of Product Semantics”, Accenture Technology Labs, Chicago, Illinois.
R. Ghani & R. Jones, “A Comparison of Efficacy and Assumptions of Bootstrapping Algorithms for Training Information Extraction Systems”, Accenture Technology Labs, Chicago IL, School of Computer Science, Carnegie Mellow University, Pittsburgh PA.
D. Lin, “Dependency-Based Evaluation of Minipar”, Department of Computer Science, University of Alberta, Edmonton, Alberta, Canada.
R. Jones, “Learning to Extract Entities from Labeled and Unlabeled Text,” School of Computer Science, Carnegie Mellon University, May 2005.
B. Liu, et al., “Opinion Observer: Analyzing and Comparing Opinions on the Web”, International World Wide Web Conference Committee (IW3C2), WWW2005, May 10-14, 2005, Chiba, Japan.
K. Nigam & R. Ghani, “Analyzing the Effectiveness and Applicability of Co-Training”, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA.
F. Peng, A. McCallum, “Accurate Information Extraction from Research Papers using Conditional Random Fields”, Department of Computer Science, University of Massachusetts, Amherst MA.
A.M. Popescu & O. Etzioni, “Extracting Product Features and Opinions from Reviews”, Department of Computer Science and Engineering, University of Washington, Seattle, WA.
M.F. Porter, “An Algorithm for Suffix Stripping”, PROGRAM, 14 No. 3, pp. 130-137, Jul. 1980.
“Text Classification from Labeled and Unlabeled Documents using EM”, Nigam, et al., Machine Learning 1-34, Kluwer Academic Publishers, Boston, Manufactured in the Netherlands.
Hoiem, et al., “Object-Based Image Retrieval Using the Statistical Structure of Images”, IEEE Conference on Computer Vision and Pattern Recognition, pp. 1-8, 2004.
DeCoste, et al., “Distortion-Invariant Recognition via Jittered Queries”, IEEE, pp. 1-6, 2002.
Zhou, Y., “Democratic Co-Learning”, 16th IEEE Conference on Tools with Artificial Intelligence, pp. 1-9, 2004.
Zufiria, et al, “Extended Backpropagation for Invariant Pattern Recognition Neural Networks”, International Joint Conference on Neural Networks, pp. 2097-2100, 1993.
Licsar, et al., “User-Adaptive Hand Gesture Recognition System with Interactive Training”, Image and Vision Computing 23, pp. 1102-1114, 2005.
Becker, D., “Sensei: A Real-Time Recognition, Feedback and Training System for T'ai Chi Gestures”, MIT Media Lab Perceptual Computing Gruop Technical Report No. 426, pp. 1-50, 1997.
Probst, K., R. Ghani, “Towards ‘Interactive’ Active Learning in Multi-view feature sets for Information Extraction”, Lecture Notes in Computer Science, vol. 4701, Sep. 8, 2007, pp. 683-690, XP019071429.
Probst, et al., “Semi-Supervised Learning of Attribute Value Pairs from Product Descriptions” Proceedings of the 20th International Joint Conferences on Artificial Intelligence (IJCAI '07) Jan. 6, 2007-Jan. 12, 2007, pp. 2838-2843, XP007906331.
Probst, et al., “Semi-Supervised Learning to Extract Attribute-Value Pairs from Product Descriptions on the Web”, Proceedings of the 2006 Workshop on Web Mining (Webmine '06); 17th European Conference on Machine Learning (ECML '06) &
Fano Andrew E.
Ghani Rayid
Krema Marko
Liu Yan
Probst Katharina
Accenture Global Services Limited
Arjomandi Noosha
Timblin Robert
Vedder Price PC
LandOfFree
Extraction of attributes and values from natural language... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Extraction of attributes and values from natural language..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Extraction of attributes and values from natural language... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2697887