System and method for portable document indexing using n-gram wo

Image analysis – Pattern recognition – Context analysis or word recognition

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

36441919, G06K 972

Patent

active

057063656

ABSTRACT:
A system and method provides for indexing and retrieval of stored documents using a decomposition of words in the documents in n-grams, or linear word subunits. The documents are indexed as pages in a number of banks. For each bank there is a bank index. The individual n-grams are identified for each page are stored in the bank index. Each bank index further contains an entry map that indicates whether a given n-gram is present in any of the pages of the bank, and then provides an index to a page map that further indicates which page in the bank contains the n-gram. When a search query is input, the query words are decomposed into their n-grams. The query word n-grams are compared first with entry maps to determine if the query word n-grams appear on any page in the bank. If so, the associated page map is traversed to determine which page in the bank contains the query word n-grams. The n-grams on the page are compared with the query word n-grams to determine the presence of an match therebetween. Matching pages are flagged. When all pages in all banks have been processed, the pages are consolidated with respect to the documents to which they belong, resulting in a list of documents that match the search query. The results are displayed to a user.

REFERENCES:
patent: 4495566 (1985-01-01), Dickinson et al.
patent: 5062142 (1991-10-01), Meckley
patent: 5062143 (1991-10-01), Schmitt
patent: 5265065 (1993-11-01), Turtle
patent: 5375235 (1994-12-01), Berry et al.
patent: 5412807 (1995-05-01), Moreland
patent: 5418951 (1995-05-01), Damashek
patent: 5469354 (1995-11-01), Hatakeyama et al.
Kimbrell, Searching for Text? Send an N-Gram|, May 1988 pp. 297-312.
Meltzer, Arnold C. and Kowalski, Gerald, "Text Searching Using An Inversion Database Consisting Of Trigrams", The Second International Conference on Computers and Applications, Beijing, People's Republic of China, Jun. 23-27, 1987.
Kimbrell, Roy E., "Searching for Text? Send an N-Gram|", Byte, pp. 297-312. May, 1988.
Guarin, Clauda Jimenez, "Access By Content Of Documents In An Office Information System", 11th International Conference on Research & Development in Information Retrieval, Grenoble, France, Jun. 13-15, 1988.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

System and method for portable document indexing using n-gram wo does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with System and method for portable document indexing using n-gram wo, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for portable document indexing using n-gram wo will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2336375

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.