Automatic evaluation of categorization system quality

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000

Reexamination Certificate

active

10455715

ABSTRACT:
A computerized method and system of document analysis. The method and system categorise documents according to a taxonomy. This is accomplished by rating training documents on a lower level by associating either of the following predicates to a training document: either correct, inbound, outbound, or unassigned, Rating categories are established on a lower level by determining precision/recall values for each category, and generating higher level category rating attributes from the lower-level rating steps. This is done by associating one or more of: aa) weak category, bb) existing source/sink relationship between categories, cc) close categories to the categories, and deriving an overall quality measure for the training base from the lower-level and higher-level rating step. The lower-level and higher-level evaluation results are stored. The quality measure is used to determine action proposals to improve the training base as either one or more of: aa) modifying the number of categories by adding a new category or deleting an existing category, or bb) splitting a category in one or more new categories, or cc) merging a category with another one, or dd) modifying the number of training documents of a category by adding or removing some of them, and optionally, providing a means to automatically carry out the above steps and review the results including the ability to restore the previous state.

REFERENCES:
patent: 6233575 (2001-05-01), Agrawal et al.
patent: 6389436 (2002-05-01), Chakrabarti et al.
patent: 6446061 (2002-09-01), Doerre et al.
patent: 2003/0033263 (2003-02-01), Cleary
patent: 2005/0114829 (2005-05-01), Robin et al.
Apte, et al. Automated Learning of Decision Rules for text Categorization, ACM Transactions on Information Systems, vol. 12, No. 3, Jul. 1994, pp. 233-251.
Sebastian, Fabrizio. Machine Learning in Automated Text Catagorization, ACM Computing Survays, vol. 34, No. 1, Mar. 2002, pp. 1-47.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Automatic evaluation of categorization system quality does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Automatic evaluation of categorization system quality, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Automatic evaluation of categorization system quality will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3844296

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.