Statistical thesaurus, method of forming same, and use thereof i

Data processing: database and file management or data structures – Database design – Data structure types

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

707 3, 707 4, 707 7, 707513, 707532, G06F 1721

Patent

active

059268115

ABSTRACT:
A statistical thesaurus is built dynamically, from the same text collection that is being searched, allowing improved generation of expanded query terms. The thesaurus is dynamic in that thesaurus records are collected, ranked, accessed, and applied dynamically. Thesaurus "records" are actually formed as indexed documents arranged in "collections". The collections are preferably distinguished based on text source (court cases versus news wires versus patents, and so forth). Each record has terms assembled in indexed groups (or segments) which inherently reflect a ranking based on relevance to an initial query. After an initial query is received, the appropriate collection(s) of records may be searched by a conventional search and retrieval engine, the searches inherently returning records ranked by degree of relevance due to the record indexing scheme. A record ranking scheme avoids contamination of relevant records by less relevant records. The record selection and the expansion query term generation processes are each divided into parallel threads. The separate threads correspond to respective text sources to enable the improved expansion query term generation to be provided in real time.

REFERENCES:
patent: 4870568 (1989-09-01), Kahle et al.
patent: 4876643 (1989-10-01), McNeill et al.
patent: 5136289 (1992-08-01), Yoshida et al.
patent: 5297039 (1994-03-01), Kanaegami et al.
patent: 5410475 (1995-04-01), Lu et al.
patent: 5469355 (1995-11-01), Tsuzuki
patent: 5481742 (1996-01-01), Worley et al.
patent: 5615378 (1997-03-01), Nishino et al.
patent: 5619709 (1997-04-01), Caid et al.
patent: 5675819 (1997-10-01), Schuette
patent: 5717914 (1998-02-01), Husick et al.
patent: 5721902 (1998-02-01), Schultz
Ahlswede, Thomas, et al., "Automatic Construction of a Phrasal Thesaurus for an Information Retrieval System from a Machine Readable Dictionary", Proceedings of RIAO '88, Cambridge, Massachusetts, Mar. 1988, pp. 597-608.
Salton, Gerard, et al., "*B Automatic Thesaurus Construction", from Chapter 3 of Introduction to Modern Information Retrieval, McGraw-Hill, New York, 1983, pp. 78-81.
Peat, Helen J., et al., "The Limitations of Term Co-Occurrence Data for Query Expansion in Document Retrieval Systems", Journal of the American Society for Information Science, vol. 42, No. 5, 1991, pp. 378-383.
Minker, Jack et al., "An Evaluation of Query Expansion by the Addition of Clustered Terms for a Document Retrieval System", Information Storage & Retrieval, Pergamon Press, Great Britain, vol. 8, 1972, pp. 329-348.
Crouch, Carolyn J., et al., "Experiments in Automatic Statistical Thesaurus Construction", Department of Computer Science, University of Minnesota, Duluth, Proceedings of the 15th International ACM-SIGIR Conference on Research and Development in Information Retrieval, Copenhagen, Denmark, 1992, pp. 77-88.
Crouch, C. J., "An Approach to the Automatic Construction of Global Thesauri", Information Processing & Management, Pergamon Press plc, Great Britain, vol. 26, No. 5, 1990, pp. 629-640.
Lesk, M. E., Division of Engineering and Applied Physics, Harvard University, "Word-Word Associations in Document Retrieval Systems", American Documentation, vol. 20, Jan. 1969, pp. 27-38.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Statistical thesaurus, method of forming same, and use thereof i does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Statistical thesaurus, method of forming same, and use thereof i, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Statistical thesaurus, method of forming same, and use thereof i will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-1331745

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.