Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
2004-12-17
2008-11-25
Truong, Cam Y (Department: 2163)
Data processing: database and file management or data structures
Database design
Data structure types
C707S793000, C707S793000, C704S004000, C704S009000
Reexamination Certificate
active
07457808
ABSTRACT:
Feature selection is used to determine feature influence for a given categorization decision to identify those features in a categorized document that were important in classifying the document into one or more classes. In one embodiment, model parameters of a categorization model are used to determine the features that contributed to the categorization decision of a document. In another embodiment, the model parameters of the categorization model and the features of the categorized document are used to determine the features that contributed to the categorization decision of a document.
REFERENCES:
patent: 6772120 (2004-08-01), Moreno et al.
patent: 6858007 (2005-02-01), Akselrod et al.
patent: 7130837 (2006-10-01), Tsochantaridis et al.
patent: 7139754 (2006-11-01), Goutte et al.
patent: 2002/0059202 (2002-05-01), Hadzikadic et al.
patent: 2003/0105863 (2003-06-01), Hegli et al.
patent: 2005/0114313 (2005-05-01), Campbell et al.
patent: 2006/0004747 (2006-01-01), Weare
patent: 1679621 (2006-07-01), None
Janez Brank, Marko Grobelnik, Nata{umlaut over (s)}a Milic-Frayling, and Dunja Mladenic, “Feature selection using support vector machines”, Microsoft Research Publication MSR-TR-2002-63, Jun. 2002 (also published in Proc. of the 3rd Int. Conf. on Data Mining Methods and Databases for Engineering, Finance, and Other Fields, Bologna, Italy, Sep. 2002).
Bernard Colin, “Information et analyse des données”, in Pub. Inst. Stat. Univ. Paris. XXXVII(3-4):43-60, 1993.
Dempster, Laird and Rubin, “Maximum likehood from incomplete data via the EM algorithm”, Journal of the Royal Statistical Society, Series B, 39(1), pp. 1-38, 1977.
Pavel B. Dobrokhotov, Cyril Goutte, Anne-Lise Veuthey Aimd Eric Gaussier, “Combining NLP and Probabilistic Categorisation for Document and term Selection for Swiss-Prot Medical Annotation”, in Proceedings of ISMB-O3, Bioinformatics. vol. 19, Suppl 1, pp. 191-194, 2003.
Drucker, Wu, and Vapnik “Support Vector Machines for Spam Categorization”, IEEE Trans. on Neural Networks, 10:5(1048-1054), 1999.
Ted Dunning, “Accurate methods for the statistics of surprise and coincidence”, in Computational Linguistics, 19(1):61-74, 1993.
Eric Gaussier, Cyril Goutte, Kris Popat, and Francine Chen, “A hierarchical model for clustering and categorising documents”, in Fabio Crestani, Mark Girolami,and Cornelis Joost van Rijsbergen, editors, Advances in Information Retrieval Proceedings of the 24th BCS-IRSC European Colloquium on JR Research, vol. 2291 of Lecture Notes in Computer Science, pp. 229-247, Springer, 2002.
Gaussier, Eric, Jean-Michel Renders, Irina Matveeva, Cyril Goutte, Hervé Déjean, “A Geometric view on billingual lexicon extraction from comparable corpora”, 42nd Annual Meeting of the Association for Computational Linguistics, Barcelona, Spain, Jul. 25-26, 2004.
Cyril Goutte, Pavel Dobrokhotov, Eric Gaussier, Anne-Lise Veuthey, “Corpus-Based vs. Model-Based Selection of Relevant Features”, in Proceedings of CORIA04, Toulouse, France, Mar. 10-12, pp. 75-88, 2004.
Mladenic, D., Brank, J., Grobelnik, M., and Milic-Frayling, N, “Feature Selection using Linear Classifier Weights: Interaction with Classification Models”, SIGIR 2004, pp. 234-241, Jul. 25-29, 2004.
Sahami, M., Dumais, S., Heckerman, D., and Horvitz, E. “A bayesian approach to filtering junk e-mail”, in Learning for Text Catergorization: Papers from the 1998 AAAI Workshop, 1998.
Yinming Yang and Jan O. Pedersen, “A comparative study on feature selection in text categorization”, in Proceedings of ICML-97, 14th International Conference on Machine Learning, pp. 412-420, 1997.
U.S. Appl. No. 10/774,966, entitled “Method for Multi-Class, Multi-Label Categorization Using Probabilistic Hierarchical Modeling”.
U.S. Appl. No. 10/976,847, entitled “Method and Apparatus for Identifying Bilingual Lexicons in Comparable Corpora”.
European Search Report for EPO counterpart Application No. EP 05 25 7572, Mar. 20, 2006.
Gaussier Eric
Goutte Cyril
Fay Sharpe LLP
Hwa Shyue Jiunn
Truong Cam Y
Xerox Corporation
LandOfFree
Method and apparatus for explaining categorization decisions does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for explaining categorization decisions, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for explaining categorization decisions will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4046850