Method and apparatus for explaining categorization decisions

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000, C707S793000, C704S004000, C704S009000

Reexamination Certificate

active

07457808

ABSTRACT:
Feature selection is used to determine feature influence for a given categorization decision to identify those features in a categorized document that were important in classifying the document into one or more classes. In one embodiment, model parameters of a categorization model are used to determine the features that contributed to the categorization decision of a document. In another embodiment, the model parameters of the categorization model and the features of the categorized document are used to determine the features that contributed to the categorization decision of a document.

REFERENCES:
patent: 6772120 (2004-08-01), Moreno et al.
patent: 6858007 (2005-02-01), Akselrod et al.
patent: 7130837 (2006-10-01), Tsochantaridis et al.
patent: 7139754 (2006-11-01), Goutte et al.
patent: 2002/0059202 (2002-05-01), Hadzikadic et al.
patent: 2003/0105863 (2003-06-01), Hegli et al.
patent: 2005/0114313 (2005-05-01), Campbell et al.
patent: 2006/0004747 (2006-01-01), Weare
patent: 1679621 (2006-07-01), None
Janez Brank, Marko Grobelnik, Nata{umlaut over (s)}a Milic-Frayling, and Dunja Mladenic, “Feature selection using support vector machines”, Microsoft Research Publication MSR-TR-2002-63, Jun. 2002 (also published in Proc. of the 3rd Int. Conf. on Data Mining Methods and Databases for Engineering, Finance, and Other Fields, Bologna, Italy, Sep. 2002).
Bernard Colin, “Information et analyse des données”, in Pub. Inst. Stat. Univ. Paris. XXXVII(3-4):43-60, 1993.
Dempster, Laird and Rubin, “Maximum likehood from incomplete data via the EM algorithm”, Journal of the Royal Statistical Society, Series B, 39(1), pp. 1-38, 1977.
Pavel B. Dobrokhotov, Cyril Goutte, Anne-Lise Veuthey Aimd Eric Gaussier, “Combining NLP and Probabilistic Categorisation for Document and term Selection for Swiss-Prot Medical Annotation”, in Proceedings of ISMB-O3, Bioinformatics. vol. 19, Suppl 1, pp. 191-194, 2003.
Drucker, Wu, and Vapnik “Support Vector Machines for Spam Categorization”, IEEE Trans. on Neural Networks, 10:5(1048-1054), 1999.
Ted Dunning, “Accurate methods for the statistics of surprise and coincidence”, in Computational Linguistics, 19(1):61-74, 1993.
Eric Gaussier, Cyril Goutte, Kris Popat, and Francine Chen, “A hierarchical model for clustering and categorising documents”, in Fabio Crestani, Mark Girolami,and Cornelis Joost van Rijsbergen, editors, Advances in Information Retrieval Proceedings of the 24th BCS-IRSC European Colloquium on JR Research, vol. 2291 of Lecture Notes in Computer Science, pp. 229-247, Springer, 2002.
Gaussier, Eric, Jean-Michel Renders, Irina Matveeva, Cyril Goutte, Hervé Déjean, “A Geometric view on billingual lexicon extraction from comparable corpora”, 42nd Annual Meeting of the Association for Computational Linguistics, Barcelona, Spain, Jul. 25-26, 2004.
Cyril Goutte, Pavel Dobrokhotov, Eric Gaussier, Anne-Lise Veuthey, “Corpus-Based vs. Model-Based Selection of Relevant Features”, in Proceedings of CORIA04, Toulouse, France, Mar. 10-12, pp. 75-88, 2004.
Mladenic, D., Brank, J., Grobelnik, M., and Milic-Frayling, N, “Feature Selection using Linear Classifier Weights: Interaction with Classification Models”, SIGIR 2004, pp. 234-241, Jul. 25-29, 2004.
Sahami, M., Dumais, S., Heckerman, D., and Horvitz, E. “A bayesian approach to filtering junk e-mail”, in Learning for Text Catergorization: Papers from the 1998 AAAI Workshop, 1998.
Yinming Yang and Jan O. Pedersen, “A comparative study on feature selection in text categorization”, in Proceedings of ICML-97, 14th International Conference on Machine Learning, pp. 412-420, 1997.
U.S. Appl. No. 10/774,966, entitled “Method for Multi-Class, Multi-Label Categorization Using Probabilistic Hierarchical Modeling”.
U.S. Appl. No. 10/976,847, entitled “Method and Apparatus for Identifying Bilingual Lexicons in Comparable Corpora”.
European Search Report for EPO counterpart Application No. EP 05 25 7572, Mar. 20, 2006.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method and apparatus for explaining categorization decisions does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method and apparatus for explaining categorization decisions, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for explaining categorization decisions will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4046850

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.