Generating a fingerprint for a document

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000, C707S793000, C707S793000

Reexamination Certificate

active

07555489

ABSTRACT:
Mechanisms for generating a set of one or more elements of a fingerprint for a document, the document comprising a semantic construct having one or more ordered words, are provided. With these mechanisms, a range of sizes for a fingerprint element is defined and ordered words of the semantic construct are divided into a set of one or more mutually exclusive fingerprint elements. Each of the one or more mutually exclusive fingerprint elements includes a number of adjacent words, the number being within the range of sizes for a fingerprint element. Responsive to a determination that the set of mutually exclusive fingerprint elements excludes a word from the semantic construct, the excluded word is discarded.

REFERENCES:
patent: 6006223 (1999-12-01), Agrawal et al.
patent: 6167369 (2000-12-01), Schulze
patent: 6349296 (2002-02-01), Broder et al.
“Syntactic Clustering of the Web” Broder, Glassman, Manasse (discloses Shingling) http://decweb.ethz.ch/WWW6/Technical/Paper205/Paper205.html.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Generating a fingerprint for a document does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Generating a fingerprint for a document, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Generating a fingerprint for a document will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4108963

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.