Data processing: database and file management or data structures – Database and file access – Preparing data for information retrieval
Reexamination Certificate
2006-08-15
2010-10-05
Ali, Mohammad (Department: 2158)
Data processing: database and file management or data structures
Database and file access
Preparing data for information retrieval
Reexamination Certificate
active
07809723
ABSTRACT:
A method and system for distributed training of a hierarchical classifier for classifying documents using a classification hierarchy is provided. A training system provides training data that includes the documents and classifications of the documents within the classification hierarchy. The training system distributes the training of the classifiers of the hierarchical classifier to various agents so that the classifiers can be trained in parallel. For each classifier, the training system identifies an agent that is to train the classifier. Each agent then trains its classifiers.
REFERENCES:
patent: 5428778 (1995-06-01), Brookes et al.
patent: 5675710 (1997-10-01), Lewis
patent: 5794236 (1998-08-01), Mehrle
patent: 5832470 (1998-11-01), Morita et al.
patent: 5870735 (1999-02-01), Agrawal et al.
patent: 6233575 (2001-05-01), Agrawal et al.
patent: 6507829 (2003-01-01), Richards et al.
patent: 6553365 (2003-04-01), Summerlin et al.
patent: 6556987 (2003-04-01), Brown et al.
patent: 6826576 (2004-11-01), Lulich et al.
patent: 2002/0083039 (2002-06-01), Ferrari et al.
patent: 2004/0111453 (2004-06-01), Harris et al.
patent: 2008/0177680 (2008-07-01), Laxman et al.
patent: 2008/0177684 (2008-07-01), Laxman et al.
U.S. Appl. No. 11/625,249, Laxman.
U.S. Appl. No. 11/625,266, Laxman.
Akbani, Rehan et al., “Applying Support Vector Machines to Imbalanced Datasets,” ECML 2004, LNAI 3201, ©Springer-Verlag Berlin Heidelberg 2004, pp. 39-50.
Bottou, Leon et al., “Comparison of Classifier Methods: A Case Study in Handwritten Digit Recognition,” JCPR, Oct. 1994, 11 pages.
Boyapati, Vijay, “Improving Hierarchical Text Classification Using Unlabeled Data,” SIGIR'02 Tampere, Finland, ACM, pp. 363-364.
Bredensteiner, Erin J. and Kristin P. Bennett, “Multicategory Classification by Support Vector Machines,” Computer Optimization and Applications, 1999, 30 pages.
CAI, Lijuan and Thomas Hofmann, “Hierarchical Document Categorization with Support Vector Machines,” CIKM'04, Washington, D.C., ©2004 ACM, pp. 78-87.
Chen, Hao and Susan Dumais, “Bringing Order to the Web: Automatically Categorizing Search Results,” Proceedings of CHI'00, Human Factors in Computing Systems, 2000, pp. 145-152.
Dumais, Susan and Hao Chen, “Hierarchical Classification of Web Content,” SIGIR 2000, Athens, Greece, ©2000 ACM, pp. 256-263.
Dunning, Ted, “Accurate Methods for the Statistics of Surprise and Coincidence,” Computational Linguistics, vol. 19, No. 1, 1993, ©1993 Association for Computational Linguistics, pp. 61-74.
Forman, George, “An Extensive Empirical Study of Feature Selection Metrics for Text Classification,” Journal of Machine Learning Research 3, 2003, ©2003 Hewlett-Packard, pp. 1289-1305.
Ghani, Rayid, “Using Error-Correcting Codes for Text Classification,” ICML, 2000, pp. 303-310.
Granitzer, Michael, “Hierarchical Text Classification using Methods from Machine Learning,” Oct. 27, 2003, Master's Thesis at Graz University of Technology, 104 pages.
Hersh, William et al., “OHSUMED: An Interactive Retrieval Evaluation and New Large Test Collection for Research,” SIGIR 1994, pp. 192-201.
Hsu, Chih-Wei and Chih-Jen Lin, “A Comparison of Methods for Multi-class Support Vector Machines,” Technical Report, Department of Computer Science and Information Engineering, National Taiwan University, 2001, 26 pages.
Joachims, Thorsten, “Making Large-Scale SVM Learning Practical,” LS-8 Report 24, Jun. 15, 1998, University of Dortmund, Computer Science Department, 17 pages.
Lewis, David D. at al., “RCV1: A New Benchmark Collection for Text Categorization Research,” Journal of Machine Learning Research, 5, 2004, pp. 361-397.
Lewis, David D., “Reuters-21578,” Test Collections, 1 page, http://www.daviddlewis.com/resources/testcollections/reuters21578.
Platt, John C., “Fast Training of Support Vector Machines using Sequential Minimal Optimization,” Advances in Kernel Methods—Support Vector Learning, MIT Press, 1999, pp. 185-208.
Sastry, P. S., “An introduction to Support Vector Machines,” Published as a Chapter in J.C. Misra (ed), Computing and Information Sciences: Recent Trends, Narosa Publishing House, New Delhi, 2003, pp. 1-44.
Sebastiani, Fabrizio, “Machine Learning in Automated Text Categorization,” ACM Computing Surveys, vol. 34, No. 1, Mar. 2002, ©2002 ACM, pp. 1-47.
Sun, Aixin and Ee-Peng Lim, “Hierarchical Text Classification and Evaluation,” ICDM, 2001, pp. 521-528.
Yang, Huai-Yuan et al., “Heterogeneous Information Integration in Hierarchical Text Classification,” PAKDD, 2006, pp. 240-249.
Yang, Yiming and Xin Liu, “A re-examination of text categorization methods,” SIGIR'99, Berkley, California, ©1999 ACM, pp. 42-49.
Yang, Yiming, “A Study on Thresholding Strategies for Text Categorization,” SIGIR'01, New Orleans, Louisiana, ©2001 ACM, pp. 137-145.
Yang, Yiming, Jian Zhang and Bryan Kisiel, “A Scalability Analysis of Classifiers in Text Categorization,” SIGIR'03, Toronto, Canada, ©2003 ACM, pp. 96-103.
Lewis, David D., “Reuters-21578”, Test Collections, 1 page, WayBackMachine: “http://web.archive.org/web/20040604003920/http://www.daviddlewis.com/resources/testcollections/reuters21578/”, Jun. 4, 2004.
Liu Tie-Yan
Ma Wei-Ying
Zeng Hua-Jun
Ali Mohammad
Microsoft Corporation
Perkins Coie LLP
Shmatov Alexey
LandOfFree
Distributed hierarchical text classification framework does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Distributed hierarchical text classification framework, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Distributed hierarchical text classification framework will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4221487