Method of generating a distributed text index for parallel...

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Reexamination Certificate

active

07324988

ABSTRACT:
The present invention relates to a method of generating a distributed text index for parallel query processing by a number of nodes. A set of node indices is generated for text indexing a set of documents, each node text index covering a subset of the documents. For each node text index, a local frequency measure for each term of the node text index is calculated on the basis of a frequency of documents containing the term in the subset of the documents of the node. A global frequency measure for each term is calculated on the basis of a frequency of documents containing the term in the set of documents. A quality measure for each node text index is calculated on the basis of the local frequency measures of the terms of the node and the global frequency measure of the terms of the node.

REFERENCES:
patent: 2004/0002973 (2004-01-01), Chaudhuri et al.
patent: 2004/0172378 (2004-09-01), Shanahan et al.
patent: 2004/0199419 (2004-10-01), Kim et al.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method of generating a distributed text index for parallel... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method of generating a distributed text index for parallel..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method of generating a distributed text index for parallel... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2749916

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.