Image analysis – Pattern recognition – Context analysis or word recognition
Patent
1995-11-09
1996-06-11
Couso, Jose L.
Image analysis
Pattern recognition
Context analysis or word recognition
382224, 382173, 395600, 36441908, G06K 972
Patent
active
055264433
ABSTRACT:
Highlighting and categorization of documents is carried out by using word tokens which represent words appearing in a document. Elimination of certain unimportant word tokens is first completed, after which the remaining words of the document are ranked according to their word token appearance rates. These rates are then used to highlight frequently appearing words in the document which indicate the document's topic. The document can also be categorized using document profiles developed from the word tokens.
REFERENCES:
patent: 4907283 (1990-03-01), Tanaka et al.
patent: 5375176 (1994-12-01), Spitz
patent: 5384863 (1995-01-01), Huttenlocher et al.
Cavnar, William & Trenkle, John, "N-Gram-Based Text Categorization," Environmental Research Institute of Michigan, pp. 161-175, 1994.
Couso Jose L.
Do Anh H.
Fuji 'Xerox Co., Ltd.
Xerox Corporation
LandOfFree
Method and apparatus for highlighting and categorizing documents does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for highlighting and categorizing documents, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for highlighting and categorizing documents will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-360298