Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
2006-02-21
2006-02-21
Corrielus, Jean M. (Department: 2162)
Data processing: database and file management or data structures
Database design
Data structure types
C707S793000, C707S793000, C704S007000, C704S009000
Reexamination Certificate
active
07003516
ABSTRACT:
A computer method for representing a natural-language document in a vector form suitable for text manipulation operations is disclosed. The method involves determining (a) for each of a plurality of terms composed of non-generic words and, optionally, proximately arranged word groups in the document, a selectivity value of the term related to the frequency of occurrence of that term in a library of texts in one field, relative to the frequency of occurrence of the same term in one or more other libraries of texts in one or more other fields, respectively. The document is represented as a vector of terms, where the coefficient assigned to each term includes a function of the selectivity value determined for that term, and optionally related to the inverse document frequency of that word in one or more libraries of texts. Also disclosed are a computer-readable code for carrying out the method, a computer system that employs the code, and a vector produced by the method.
REFERENCES:
patent: 4554631 (1985-11-01), Reddington
patent: 5297039 (1994-03-01), Kanaegami et al.
patent: 5694592 (1997-12-01), Driscoll
patent: 5745889 (1998-04-01), Burrows
patent: 5745890 (1998-04-01), Burrows
patent: 5752051 (1998-05-01), Cohen
patent: 5867811 (1999-02-01), O'Donoghue
patent: 5873056 (1999-02-01), Liddy et al.
patent: 5893102 (1999-04-01), Maimone et al.
patent: 5915249 (1999-06-01), Spencer
patent: 5937422 (1999-08-01), Nelson et al.
patent: 5983171 (1999-11-01), Yokoyama et al.
patent: 6006221 (1999-12-01), Liddy et al.
patent: 6006223 (1999-12-01), Agrawal et al.
patent: 6009397 (1999-12-01), Siegel
patent: 6081774 (2000-06-01), de Hita et al.
patent: 6088692 (2000-07-01), Driscoll
patent: 6216102 (2001-04-01), Martino et al.
patent: 6275801 (2001-08-01), Novak et al.
patent: 6279017 (2001-08-01), Walker
patent: 6374210 (2002-04-01), Chu
patent: 6393389 (2002-05-01), Chanod et al.
patent: 6415250 (2002-07-01), van den Akker
patent: 6529902 (2003-03-01), Kanevsky et al.
patent: 6633868 (2003-10-01), Min et al.
patent: 6665668 (2003-12-01), Sugaya et al.
patent: 6669091 (2003-12-01), Sharpe et al.
patent: 6687689 (2004-02-01), Fung et al.
patent: 2002/0022974 (2002-02-01), Lindh
patent: 2002/0052901 (2002-05-01), Guo et al.
patent: 2003/0026459 (2003-02-01), Won et al.
patent: 2003/0028566 (2003-02-01), Nakano
patent: 2004/0015481 (2004-01-01), Zinda
patent: 2004/0024733 (2004-02-01), Won et al.
patent: 2004/0111388 (2004-06-01), Boiscuvier et al.
patent: 2004/0186833 (2004-09-01), Watts
patent: 2004/0230568 (2004-11-01), Budzyn
patent: 0 524 385 (1993-01-01), None
patent: 0 597 630 (1994-05-01), None
patent: 0 813 158 (1997-12-01), None
patent: 1 011 056 (2000-06-01), None
patent: 1 049 030 (2000-11-01), None
patent: 1 168 202 (2002-01-01), None
patent: 2264186 (1993-08-01), None
patent: WO 99/10819 (1999-03-01), None
patent: WO 03/079231 (2003-09-01), None
Strzalkowski, T. et al., “Natural language information retrieval in digital libraries”, ACM 117-125, 1996.
Michael, J.B. et al., “Natural-language processing support for developing policy-governed software systems”, 39thIntl. Conf. on Techn. for Object-oriented Lang. and Syst., IEEE Computer Soc. Press, pp. 263-274, Jul. 2001.
Lin, D. and Pantel, P., “Induction of Semantic Classes from Natural Language Text”, KDD, ACM 2001, 6 pages.
Berg, G., “A connectionist Parser with Recursive Sentence Structure and lexical Disambiguation”, Proc. Tenth National conf. on Artificial Intelligence—AAA1-92, 1992, 6 pages.
Niwa, Y. et al., “Patent Search: A Case Study of Cross-DB Associative Search”, Proc. Of the Third NTCIR Workshop, 2003 Natl. Inst. of Informatics, 7 pages.
Larkey, L., “A Patent Search and Classification System”, Proc. Of DL-99, 4thACM Conference on Digital Libraries, 1999, 9 pages.
Cohen, W., “Text Categorization and Relational Learning”, Proc. of 12thIntl. Conference (ML95) on Machine Learning, 1995, 9 pages.
Krahmer, E. and Theune, M., “Context Sensitive Generation of Descriptions”, 1998, 4 pages.
Meyer, H. et al., “The Xircus Search Engine”, Univ. of Rostock, Database Research Group, 2003, 6 pages.
Ford, G. et al., “Patern matching techniques for correcting low confidence OCR words in a known context”, Natl. Library of Medicine, Bethesda, Maryland 20894, 9 pages.
Chin Shao
Dehlinger Peter J.
Corrielus Jean M.
Perkins Coie LLP
Word Data Corp.
LandOfFree
Text representation and method does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Text representation and method, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Text representation and method will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3676286