Data processing: database and file management or data structures – Database design – Data structure types
Patent
1996-03-15
1999-07-20
Black, Thomas G.
Data processing: database and file management or data structures
Database design
Data structure types
707 3, 707 4, 707 7, 707513, 707532, G06F 1721
Patent
active
059268115
ABSTRACT:
A statistical thesaurus is built dynamically, from the same text collection that is being searched, allowing improved generation of expanded query terms. The thesaurus is dynamic in that thesaurus records are collected, ranked, accessed, and applied dynamically. Thesaurus "records" are actually formed as indexed documents arranged in "collections". The collections are preferably distinguished based on text source (court cases versus news wires versus patents, and so forth). Each record has terms assembled in indexed groups (or segments) which inherently reflect a ranking based on relevance to an initial query. After an initial query is received, the appropriate collection(s) of records may be searched by a conventional search and retrieval engine, the searches inherently returning records ranked by degree of relevance due to the record indexing scheme. A record ranking scheme avoids contamination of relevant records by less relevant records. The record selection and the expansion query term generation processes are each divided into parallel threads. The separate threads correspond to respective text sources to enable the improved expansion query term generation to be provided in real time.
REFERENCES:
patent: 4870568 (1989-09-01), Kahle et al.
patent: 4876643 (1989-10-01), McNeill et al.
patent: 5136289 (1992-08-01), Yoshida et al.
patent: 5297039 (1994-03-01), Kanaegami et al.
patent: 5410475 (1995-04-01), Lu et al.
patent: 5469355 (1995-11-01), Tsuzuki
patent: 5481742 (1996-01-01), Worley et al.
patent: 5615378 (1997-03-01), Nishino et al.
patent: 5619709 (1997-04-01), Caid et al.
patent: 5675819 (1997-10-01), Schuette
patent: 5717914 (1998-02-01), Husick et al.
patent: 5721902 (1998-02-01), Schultz
Ahlswede, Thomas, et al., "Automatic Construction of a Phrasal Thesaurus for an Information Retrieval System from a Machine Readable Dictionary", Proceedings of RIAO '88, Cambridge, Massachusetts, Mar. 1988, pp. 597-608.
Salton, Gerard, et al., "*B Automatic Thesaurus Construction", from Chapter 3 of Introduction to Modern Information Retrieval, McGraw-Hill, New York, 1983, pp. 78-81.
Peat, Helen J., et al., "The Limitations of Term Co-Occurrence Data for Query Expansion in Document Retrieval Systems", Journal of the American Society for Information Science, vol. 42, No. 5, 1991, pp. 378-383.
Minker, Jack et al., "An Evaluation of Query Expansion by the Addition of Clustered Terms for a Document Retrieval System", Information Storage & Retrieval, Pergamon Press, Great Britain, vol. 8, 1972, pp. 329-348.
Crouch, Carolyn J., et al., "Experiments in Automatic Statistical Thesaurus Construction", Department of Computer Science, University of Minnesota, Duluth, Proceedings of the 15th International ACM-SIGIR Conference on Research and Development in Information Retrieval, Copenhagen, Denmark, 1992, pp. 77-88.
Crouch, C. J., "An Approach to the Automatic Construction of Global Thesauri", Information Processing & Management, Pergamon Press plc, Great Britain, vol. 26, No. 5, 1990, pp. 629-640.
Lesk, M. E., Division of Engineering and Applied Physics, Harvard University, "Word-Word Associations in Document Retrieval Systems", American Documentation, vol. 20, Jan. 1969, pp. 27-38.
Holt John David
Lu Xin Allan
Miller David James
Black Thomas G.
Homere Jean R.
Lexis-Nexis
LandOfFree
Statistical thesaurus, method of forming same, and use thereof i does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Statistical thesaurus, method of forming same, and use thereof i, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Statistical thesaurus, method of forming same, and use thereof i will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1331745