Method for determining the resemblance of documents

Data processing: database and file management or data structures – Database design – Data structure types

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

707203, 707201, 707 1, G06F 1730

Patent

active

059096772

ABSTRACT:
A method for facilitating the comparison of two computerized documents. The method includes loading a first document into a random access memory (RAM), loading a second document into the RAM, reducing the first document into a first sequence of tokens, reducing the second document into a second sequence of tokens, converting the first set of tokens to a first (multi)set of shingles, converting the second set of tokens to a second (multi)set of shingles, determining a first sketch of the first (multi)set of shingles, determining a second sketch of the second (multi)set of shingles, and comparing the first sketch and the second sketch. The sketches have a fixed size, independent of the size of the documents. The resemblance of two documents is provided using a sketch of each document. The sketches may be computed fairly fast and given two sketches the resemblance of the corresponding documents can be computed in linear time in the size of the sketches.

REFERENCES:
patent: 5442780 (1995-08-01), Takanashi et al.
patent: 5544049 (1996-08-01), Henderson et al.
patent: 5557249 (1996-09-01), Califano
patent: 5778363 (1998-07-01), Light

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method for determining the resemblance of documents does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method for determining the resemblance of documents, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for determining the resemblance of documents will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-962518

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.