Data processing: speech signal processing – linguistics – language – Linguistics – Natural language
Patent
1999-08-03
2000-12-12
Thomas, Joseph
Data processing: speech signal processing, linguistics, language
Linguistics
Natural language
704 10, 707 2, 707 5, G06F 1727, G06F 1730
Patent
active
061610844
ABSTRACT:
The present invention is directed to performing information retrieval utilizing semantic representation of text. In a preferred embodiment, a tokenizer generates from an input string information retrieval tokens that characterize the semantic relationship expressed in the input string. The tokenizer first creates from the input string a primary logical form characterizing a semantic relationship between selected words in the input string. The tokenizer then identifies hypernyms that each have an "is a" relationship with one of the selected words in the input string. The tokenizer then constructs from the primary logical form one or more alternative logical forms. The tokenizer constructs each alternative logical form by, for each of one or more of the selected words in the input string, replacing the selected word in the primary logical form with an identified hypernym of the selected word. Finally, the tokenizer generates tokens representing both the primary logical form and the alternative logical forms. The tokenizer is preferably used to generate tokens for both constructing an index representing target documents and processing a query against that index.
REFERENCES:
patent: 4839853 (1989-06-01), Deerwester et al.
patent: 5146406 (1992-09-01), Jensen
patent: 5278980 (1994-01-01), Pedersen et al.
patent: 5325298 (1994-06-01), Gallant
patent: 5386556 (1995-01-01), Hedin et al.
patent: 5454106 (1995-09-01), Burns et al.
patent: 5592661 (1997-01-01), Eisenberg et al.
patent: 5619709 (1997-04-01), Caid et al.
patent: 5630121 (1997-05-01), Braden-Harder et al.
patent: 5675745 (1997-10-01), Oku et al.
patent: 5675819 (1997-10-01), Schuetze
patent: 5794050 (1998-08-01), Dahlgren et al.
patent: 5794178 (1998-08-01), Caid et al.
patent: 5799308 (1998-08-01), Dixon
patent: 5873056 (1999-02-01), Liddy et al.
patent: 5893104 (1999-04-01), Srinivasan et al.
patent: 5895464 (1999-04-01), Bhandari et al.
patent: 5933822 (1999-08-01), Braden-Harder et al.
patent: 5963940 (1999-10-01), Liddy et al.
patent: 5966686 (1999-10-01), Heidorn et al.
patent: 5970490 (1999-10-01), Morgenstern
patent: 5991713 (1999-11-01), Unger et al.
patent: 5995922 (1999-11-01), Penteroudakis et al.
patent: 6038561 (2000-03-01), Snyder et al.
patent: 6070134 (2000-05-01), Richardson et al.
patent: 6076051 (2000-06-01), Messerly et al.
Gerard Salton, "Automatic Information Organization and Retrieval," McGraw Hill Book Company, pp. 168-178 (1968).
Fagan, Joel L., PhD., "Experiments in automatic phrase indexing for document retrieval: A comparision of syntactic and non-syntatic methods," Cornell University, UMI Dissertation Information Service, pp. 1-261 (1987).
James Allen, "Natural Language Understanding," The Benjamin/Cummings Publishing Company, Inc., Chapter 8, pp. 227-238 (1995).
Van Zuijlen, Job M., "Probabilistic Methods in Dependency Grammer Parsing," International Parsing Workshop, 10375:142-151 (1989).
Dolan William B.
Heidorn George E.
Jensen Karen
Messerly John J.
Richardson Stephen D.
Microsoft Corporation
Thomas Joseph
LandOfFree
Information retrieval utilizing semantic representation of text does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Information retrieval utilizing semantic representation of text , we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Information retrieval utilizing semantic representation of text will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-226003