Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
2007-03-28
2009-11-10
Corrielus, Jean M (Department: 2162)
Data processing: database and file management or data structures
Database design
Data structure types
C707S793000, C707S793000, C707S793000
Reexamination Certificate
active
07617195
ABSTRACT:
In accordance with the disclosure, there is provided a method for identifying duplicate documents comprising drafting a first document and creating a near unique representative string based on the document content. The method further comprises searching for other documents with the same NRS and selectively assigning a duplicate group identification to the first document, the duplicate group identification is unique if no near unique representative string matches are found, or the duplicate group identification is the same as an associated duplicate document's duplicate group identification that matches the NRS. The method further comprises placing the DGI into a meta-data of the first document and recalling a list of duplicates of a particular document based upon user demand by searching the meta-data and selecting documents using the same DGI.
REFERENCES:
patent: 5486686 (1996-01-01), Zdybel, Jr. et al.
patent: 5629980 (1997-05-01), Stefik
patent: 5634012 (1997-05-01), Stefik
patent: 5638443 (1997-06-01), Stefik
patent: 5715403 (1998-02-01), Stefik
patent: 5893908 (1999-04-01), Cullen
patent: 6041323 (2000-03-01), Kubota
patent: 6236971 (2001-05-01), Stefik
patent: 6615209 (2003-09-01), Gomes
patent: 6658423 (2003-12-01), Pugh
patent: 6792576 (2004-09-01), Chidlovskii
patent: 6917936 (2005-07-01), Cancedda
patent: 7035841 (2006-04-01), Chidlovskii
patent: 7266554 (2007-09-01), Kayahara et al.
patent: 7370034 (2008-05-01), Franciosa et al.
patent: 7493322 (2009-02-01), Franciosa et al.
patent: 2002/0069222 (2002-06-01), McNeely
patent: 2004/0167881 (2004-08-01), Masuda
patent: 2004/0261016 (2004-12-01), Glass et al.
patent: 2005/0086205 (2005-04-01), Franciosa
patent: 2005/0086224 (2005-04-01), Franciosa et al.
patent: 2008/0162455 (2008-07-01), Daga et al.
patent: 2008/0243837 (2008-10-01), Davis et al.
“Query-Free News Search”—Henzinger et al.—International World Wide Web Conference—Budapest, Hungary—Association for Computing Machinary—ACM, May 20-24, 2003 (pp. 1-10).
“Keyword-Based Document Clustering” Seung-Shik Kang—Proceedings of the 6thInternational Workshop on information Retrieval with Asian Language, vol. 11, Sapporo, Japan—ACM—2003 (pp. 132-137).
“The Detection of Duplicates in Document Image Databases” Doermann et al.—PSU.edu—Feb. 1997 (pp. 1-39).
Gastaldo Michel
Liang Tao
Monet Nicolas
Ragnet Francois
Zhu Xianing
Corrielus Jean M
Fay Sharpe LLP
Hauber Karl W.
Ly Anh
Xerox Corporation
LandOfFree
Optimizing the performance of duplicate identification by... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Optimizing the performance of duplicate identification by..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Optimizing the performance of duplicate identification by... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4106046