Identification of files with similar content

Data processing: database and file management or data structures – Data integrity – Using checksum

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S747000, C707S749000

Reexamination Certificate

active

07814078

ABSTRACT:
A method, apparatus, and system identity files with similar content. One embodiment is a method that divides files into plural segments. The method computes a hash value and a size for each of the plural segments of the files. In order to identify which files have similar content, the method adds together segments common between files. File similarity information of files with similar content is output.

REFERENCES:
patent: 2005/0091234 (2005-04-01), Hsu et al.
patent: 2005/0216813 (2005-09-01), Cutts et al.
patent: 2006/0253476 (2006-11-01), Roth et al.
Quinlin at al., “Venti: A New Approach to Archival Storage”, Jan. 2002, Proceedings of FAST 2002 Conference on File and Storage Technologies, pp. 1-14.
Udi Manber, “Finding Similar Files in a Large File System”, TR-93-33, Oct. 1993, Department of Computer Science, University of Arizona, pp. 1-10.
Val Henson, Guidelines for Using Compare-by-hash, IBM, Inc., Red Hat, Inc. p. 1-14.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Identification of files with similar content does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Identification of files with similar content, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Identification of files with similar content will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4206717

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.