Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
2005-08-09
2005-08-09
Cheung, Mary (Department: 3621)
Data processing: database and file management or data structures
Database design
Data structure types
C382S168000, C382S173000, C382S181000, C382S232000, C382S276000, C345S427000
Reexamination Certificate
active
06928435
ABSTRACT:
An apparatus and method for determining if a query document matches one or more of a plurality of documents in a database. In a coarse matching stage, a compressed file or other query document is scanned to produce a bit profile. Global statistics such as line spacing and text height are calculated from the bit profile and used to narrow the field of documents to be searched in an image database. The bit profile is cross-correlated with bit profiles of documents in the search space to identify candidates for a detailed matching stage. If multiple candidates are generated in the coarse matching stage, a set of endpoint features is extracted from the query document for detailed matching in the detailed matching stage. Endpoint features contain sufficient information for various levels of processing, including page skew and orientation estimation. In addition, endpoint features are stable, symmetric and easily computable from commonly used compressed files including, but not limited to, CCITT Group 4 compressed files. Endpoint features extracted in the detailed matching stage are used to correctly identify a matching document in a high percentage of cases.
REFERENCES:
patent: 4292622 (1981-09-01), Henrichon, Jr.
patent: 4809081 (1989-02-01), Linehan
patent: 4985863 (1991-01-01), Fujisawa et al.
patent: 5278920 (1994-01-01), Bernzott et al.
patent: 5351310 (1994-09-01), Califano et al.
patent: 5465353 (1995-11-01), Hull et al.
patent: 5689585 (1997-11-01), Bloomberg et al.
patent: 5867597 (1999-02-01), Peairs et al.
patent: 5893095 (1999-04-01), Jain et al.
patent: 6249604 (2001-06-01), Huttenlocher et al.
patent: 6268935 (2001-07-01), Kingetsu et al.
patent: 6363381 (2002-03-01), Lee et al.
patent: 581971 (1994-02-01), None
Doermann et al., Detection of Duplicates in Document Image Databases, Aug. 24, 1998, Image and Vision Computing v16 n12-13, pp 907-920.
Hull Johnathan
Lee Dar-Shyang
Blakely , Sokoloff, Taylor & Zafman LLP
Cheung Mary
Ricoh Co. Ltd.
LandOfFree
Compressed document matching does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Compressed document matching, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Compressed document matching will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3522762