Hierarchical presearch type text search method and apparatus and

Image analysis – Histogram processing – For setting a threshold

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

G10L 302

Patent

active

051685337

ABSTRACT:
A method and apparatus for making document information search and a magnetic disk unit to be used for realizing the method and apparatus. In the document information search method, in performing document search with respect to a desired subject key word, two stages of presearch are carried out. In a first stage of presearch (step 402), a character component table (500) in which existence of character codes for every document is stated with respect to all the character codes contained in the group of document text data of stored documents is generated, and the character component table is searched for all the character codes constituting a desiredly designated search subject key word to thereby extract all the documents each containing all the character codes constituting the search subject key word. In a second stage of presearch step 403), contracted text data for every document in which adjuncts and duplication of repeatedly stated words contained in advance in the text data are eliminated is generated, and the documents each containing the search subject key words by word are extracted from the documents extracted by the first presearch. After the second stage of presearch, text search is performed in accordance with a neighbor condition, a contextual condition, or the like (step 404). Further, as a term comparator means, hardware (1106) for exclusive use for term comparison in accordance with a finite automation is employed. Further, as for different notation and synonym, inputted terms are once subject to different notation development in a different notation development processing portion (2601), each of the different-notation developed terms is subject to synonym development in a synonym development processing portion (2602) while referring to a synonym dictionary, and then the results of synonym development are further subject to different notation development in a different notation development processing portion (2603) in accordance with a conversion rule table (2603).

REFERENCES:
patent: 4320451 (1982-03-01), Bachman et al.
patent: 4395757 (1983-07-01), Bienvenu et al.
patent: 4418385 (1983-11-01), Bourrez
patent: 4430699 (1984-02-01), Segarra et al.
patent: 4539655 (1985-09-01), Trussell et al.
patent: 4589065 (1986-05-01), Auslander et al.
patent: 4635189 (1987-01-01), Kendall
patent: 4870704 (1989-09-01), Matelan et al.
Roger L. Haskin, et al., "Operational Characteristics of a Hardware-Based Pattern Matcher", ACM Transactions on Database Systems, vol. 8, No. 1, Mar. 1983, pp. 15-40.
Alfred V. Aho, et al., "Efficient String Matching: An Aid to Bibliographic Search", Communications of the ACM, vol. 18, no. 6, Jun. 1975, pp. 333-340.
Haskin and A. Hollaar: "Operational Characteristics of a Hardware-Based Pattern Matcher", ACM Trans. on Database System, vol. 8, No. 1, 1983.
A. V. Aho and M. J. Corasick: "Efficient String Matching", CACM, vol. 18, No. 6, 1975.
JP-A-60-105039.
JP-A-60-105040.
JP-A-63-311530.
JP-A-62-011932.
JP-A-60-117326.
JP-A-62-241026.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Hierarchical presearch type text search method and apparatus and does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Hierarchical presearch type text search method and apparatus and, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Hierarchical presearch type text search method and apparatus and will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-507921

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.