Overlapping subdocuments in a vector space search process

Data processing: database and file management or data structures – Database design – Data structure types

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

707500, G06F 1730

Patent

active

059078405

ABSTRACT:
The present invention is a method and apparatus for retrieving information from a database. Initially, the documents within the database are divided into mutually exclusive subdocuments that generally correspond to paragraphs of text. The present invention further creates a second set of subdocuments that overlap adjacent paragraphs of text. In particular, the location of the overlapping subdocuments depends on the size of the initial paragraphs. This second set of overlapping subdocuments are scored just as the mutually exclusive subdocuments are scored. The scores from both the mutually exclusive and overlapping subdocuments are used in ranking the relevance of documents to a query. The use of both sets of subdocument scores improves the effectiveness of the scoring algorithm.

REFERENCES:
patent: 5642502 (1997-06-01), Driscoll
patent: 5724567 (1998-03-01), Rose et al.
patent: 5724571 (1998-03-01), Woods
patent: 5794178 (1998-08-01), Caid et al.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Overlapping subdocuments in a vector space search process does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Overlapping subdocuments in a vector space search process, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Overlapping subdocuments in a vector space search process will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-409037

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.