Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
2006-04-04
2006-04-04
Corrielus, Jean M. (Department: 2162)
Data processing: database and file management or data structures
Database design
Data structure types
C707S793000, C704S007000, C704S009000
Reexamination Certificate
active
07024408
ABSTRACT:
Disclosed are a computer-readable code, system and method for classifying a target document in the form of a digitally encoded natural-language text as belonging to one or more of two or more different classes. For each of a plurality of non-generic words and/or words groups characterizing the target document, there is determined a selectivity value calculated as the frequency of occurrence of that term in a library of texts in one field, relative to the frequency of occurrence of the same term in one or more other libraries of texts in one or more other fields, respectively, and the document is represented as a vector of terms, where the coefficient assigned to each term is a function of the selectivity value determined for that term. There is then determined, for each of the plurality of sample texts having associated classification identifiers, a match score related to the number of descriptive terms present in or derived from that text that match those in the target text. From the selected matched texts, and the associated classification identifiers, a classification determination of the target document is made.
REFERENCES:
patent: 4554631 (1985-11-01), Reddington
patent: 5297039 (1994-03-01), Kanaegami et al.
patent: 5694592 (1997-12-01), Driscoll
patent: 5745889 (1998-04-01), Burrows
patent: 5745890 (1998-04-01), Burrows
patent: 5752051 (1998-05-01), Cohen
patent: 5867811 (1999-02-01), O'Donoghue
patent: 5873056 (1999-02-01), Liddy et al.
patent: 5893102 (1999-04-01), Maimone et al.
patent: 5915249 (1999-06-01), Spencer
patent: 5937422 (1999-08-01), Nelson et al.
patent: 5983171 (1999-11-01), Yokoyama et al.
patent: 6006221 (1999-12-01), Liddy et al.
patent: 6006223 (1999-12-01), Agrawal et al.
patent: 6009397 (1999-12-01), Siegel
patent: 6081774 (2000-06-01), de Hita et al.
patent: 6088692 (2000-07-01), Driscoll
patent: 6216102 (2001-04-01), Martino et al.
patent: 6275801 (2001-08-01), Novak et al.
patent: 6279017 (2001-08-01), Walker
patent: 6374210 (2002-04-01), Chu
patent: 6393389 (2002-05-01), Chanod et al.
patent: 6415250 (2002-07-01), van den Akker
patent: 6529902 (2003-03-01), Kanevsky et al.
patent: 6574632 (2003-06-01), Fox et al.
patent: 6633868 (2003-10-01), Min et al.
patent: 6665668 (2003-12-01), Sugaya et al.
patent: 6669091 (2003-12-01), Sharpe et al.
patent: 6687689 (2004-02-01), Fung et al.
patent: 6741959 (2004-05-01), Kaiser
patent: 2002/0022974 (2002-02-01), Lindh
patent: 2002/0052901 (2002-05-01), Guo et al.
patent: 2003/0026459 (2003-02-01), Won et al.
patent: 2003/0028566 (2003-02-01), Nakano
patent: 2004/0015481 (2004-01-01), Zinda
patent: 2004/0024733 (2004-02-01), Won et al.
patent: 2004/0111388 (2004-06-01), Boiscuvier et al.
patent: 2004/0186833 (2004-09-01), Watts
patent: 2004/0230568 (2004-11-01), Budzyn
patent: 0 524 385 (1993-01-01), None
patent: 0 597 630 (1994-05-01), None
patent: 0 813 158 (1997-12-01), None
patent: 1 011 056 (2000-06-01), None
patent: 1 049 030 (2000-11-01), None
patent: 1 168 202 (2002-01-01), None
patent: 2264186 (1993-08-01), None
patent: WO 99/10819 (1999-03-01), None
patent: WO 03/079231 (2003-09-01), None
Strzalkowski, T. et al., “Natural language information retrieval in digital libraries”, ACM 117-125, 1996.
Michael, J. B. et al., “Natural-language processing support for developing policy-governed software systems”, 39thIntl. Conf. on Techn. for Object-oriented Lang. and Syst., IEEE Computer Soc. Press, pp. 263-274, Jul. 2001.
Lin, D. and Pantel, P., “Induction of Semantic Classes from Natural Language Text”, KDD, ACM 2001, 6 pages.
Berg, G., “A connectionist Parser with Recursive Sentence Structure and lexical Disambiguation”, Proc. Tenth National conf. on Artificial Intelligence—AAA1-92, 1992, 6 pages.
Niwa, Y. et al., “Patent Search: A Case Study of Cross-DB Associative Search”, Proc. Of the Third NTCIR Workshop, 2003 Natl. Inst. of Informatics, 7 pages.
Larkey, L., “A Patent Search and Classification System”, Proc. Of DL-99, 4thACM Conference on Digital Libraries, 1999, 9 pages.
Cohen, W., “Text Categorization and Relational Learning”, Proc. of 12thIntl. Conference (ML95) on Machine Learning, 1995, 9 pages.
Krahmer, E. and Theune, M., “Context Sensitive Generation of Descriptions”, 1998, 4 pages.
Meyer, H. et al., “The Xircus Search Engine”, Univ. of Rostock, Database Research Group, 2003, 6 pages.
Ford, G. et al., “Patern matching techniques for correcting low confidence OCR words in a known context”, Natl. Library of Medicine, Bethesda, Maryland 20894, 9 pages.
Chin Shao
Dehlinger Peter J.
Corrielus Jean M.
Dehlinger Peter J.
Word Data Corp.
LandOfFree
Text-classification code, system and method does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Text-classification code, system and method, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Text-classification code, system and method will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3575612