Optimizing the performance of duplicate identification by...

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000, C707S793000, C707S793000

Reexamination Certificate

active

07617195

ABSTRACT:
In accordance with the disclosure, there is provided a method for identifying duplicate documents comprising drafting a first document and creating a near unique representative string based on the document content. The method further comprises searching for other documents with the same NRS and selectively assigning a duplicate group identification to the first document, the duplicate group identification is unique if no near unique representative string matches are found, or the duplicate group identification is the same as an associated duplicate document's duplicate group identification that matches the NRS. The method further comprises placing the DGI into a meta-data of the first document and recalling a list of duplicates of a particular document based upon user demand by searching the meta-data and selecting documents using the same DGI.

REFERENCES:
patent: 5486686 (1996-01-01), Zdybel, Jr. et al.
patent: 5629980 (1997-05-01), Stefik
patent: 5634012 (1997-05-01), Stefik
patent: 5638443 (1997-06-01), Stefik
patent: 5715403 (1998-02-01), Stefik
patent: 5893908 (1999-04-01), Cullen
patent: 6041323 (2000-03-01), Kubota
patent: 6236971 (2001-05-01), Stefik
patent: 6615209 (2003-09-01), Gomes
patent: 6658423 (2003-12-01), Pugh
patent: 6792576 (2004-09-01), Chidlovskii
patent: 6917936 (2005-07-01), Cancedda
patent: 7035841 (2006-04-01), Chidlovskii
patent: 7266554 (2007-09-01), Kayahara et al.
patent: 7370034 (2008-05-01), Franciosa et al.
patent: 7493322 (2009-02-01), Franciosa et al.
patent: 2002/0069222 (2002-06-01), McNeely
patent: 2004/0167881 (2004-08-01), Masuda
patent: 2004/0261016 (2004-12-01), Glass et al.
patent: 2005/0086205 (2005-04-01), Franciosa
patent: 2005/0086224 (2005-04-01), Franciosa et al.
patent: 2008/0162455 (2008-07-01), Daga et al.
patent: 2008/0243837 (2008-10-01), Davis et al.
“Query-Free News Search”—Henzinger et al.—International World Wide Web Conference—Budapest, Hungary—Association for Computing Machinary—ACM, May 20-24, 2003 (pp. 1-10).
“Keyword-Based Document Clustering” Seung-Shik Kang—Proceedings of the 6thInternational Workshop on information Retrieval with Asian Language, vol. 11, Sapporo, Japan—ACM—2003 (pp. 132-137).
“The Detection of Duplicates in Document Image Databases” Doermann et al.—PSU.edu—Feb. 1997 (pp. 1-39).

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Optimizing the performance of duplicate identification by... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Optimizing the performance of duplicate identification by..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Optimizing the performance of duplicate identification by... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4106046

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.