Enhanced compression of documents

Image analysis – Image compression or coding – Quantization

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C382S250000

Reexamination Certificate

active

06606418

ABSTRACT:

BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention generally relates to compression of data representing graphical images for transmission and/or storage and, more particularly, to extreme compression of digital images of documents.
2. Description of the Prior Art
Pictorial and graphics images contain extremely large amounts of data and, if digitized to allow transmission or processing by digital data processors, often requires many millions of bytes to represent respective pixels of the image or graphics with good fidelity. The purpose of image compression is to represent images with less data in order to save storage costs or transmission time and costs. The most effective compression is achieved by approximating the original image, rather than reproducing it exactly. The JPEG (Joint Photographic Experts Group) standard, discussed in detail in “JPEG Still Image Data Compression Standard” by Pennebaker and Mitchell, published by Van Nostrand Reinhold, 1993, which is hereby fully incorporated by reference, allows the interchange of images between diverse applications and opens up the capability to provide digital continuous-tone color images in multi-media applications.
JPEG is primarily concerned with images that have two spatial dimensions, contain gray scale or color information, and possess no temporal dependence, as distinguished from the MPEG (Moving Picture Experts Group) standard. JPEG compression can reduce the storage requirements by more than an order of magnitude and improve system response time in the process. A primary goal of the JPEG standard is to provide the maximum image fidelity for a given volume of data and/or available transmission or processing time and any arbitrary degree of data compression is accommodated. It is often the case that data compression by a factor of twenty or more (and reduction of transmission time and storage size by a comparable factor) will not produce artifacts which are noticeable to the average viewer.
Of course, other data compression techniques are possible and may produce greater degrees of image compression for certain classes of images or graphics having certain known characteristics. The JPEG standard has been fully generalized to perform substantially equally regardless of image content and to accommodate a wide variety of data compression demands. Therefore, encoders and decoders employing the JPEG standard in one or more of several versions have come into relatively widespread use and allow wide access to images for a wide variety of purposes. Standardization has also allowed reduction of costs, particularly of decoders, to permit high quality image access to be widely available. Therefore, utilization of the JPEG standard is generally preferable to other data compression techniques even though some marginal increase of efficiency might be obtained thereby, especially for particular and well-defined classes of images.
Even though such large reductions in data volume are possible, particularly using techniques in accordance with the JPEG standard, some applications require severe trade-offs between image quality and costs of data storage or transmission time. For example, there may be a need to store an image for a period of time which is a significant fraction of the useful lifetime of the storage medium or device as well as requiring a significant amount of its storage capacity. Therefore, the cost of storing an image for a given period of time can be considered as a fraction of the cost of the storage medium or device and supporting data processor installation, notwithstanding the fact that the image data could potentially be overwritten an arbitrarily large number of times. The cost of such storage is, of course, multiplied by the number of images which must be stored.
Another way to determine the storage cost versus image quality trade-off is to determine the maximum cost in storage that is acceptable and then determine, for a given amount of quality, how long the desired number of images can be saved in the available storage. This is a function of the compressed size of the images which generally relates directly to the complexity of the images and inversely with the desired reconstruction quality.
An example of such a demanding application is the storage of legal documents which must be stored for an extended period of time, if not archivally, especially negotiable instruments such as personal checks which are generated in large numbers amounting to tens of millions daily. While the initial clearing of personal checks and transfer of funds is currently performed using automated equipment and is facilitated by the use of machine readable indicia printed on the check, errors remain possible and it may be necessary to document a particular transaction for correction of an error long after the transaction of which the check formed a part.
As a practical matter, the needed quality of the image data also changes over time in such an application. For example, within a few months of the date of the document or its processing, questions of authenticity often arise, requiring image quality sufficient to, for example, authenticate a signature, while at a much later date, it may only be necessary for the image quality to be sufficient to confirm basic information about the content of the document. Therefore, the image data may be additionally compressed for longer term storage when reduced image quality becomes more tolerable, particularly in comparison with the costs of storage. At the present time, personal check images are immediately stored for archivaql purposes on write-once CD-ROM or other non-modifiable media and saved, for legal reasons, for seven years. The same data is available for only a few months in on-line, rapid-access storage.
Personal checks, in particular, present some image data compression complexities. For example, to guard against fraudulent transactions, a background pattern of greater or lesser complexity and having a range of image values is invariably provided. Some information will be printed in a highly contrasting ink, possibly of multiple colors, while other security information will be included at relatively low contrast. Decorations including a wide range of image values may be included. Additionally, hand-written or printed indicia (e.g. check amounts and signature) will be provided with image values which are not readily predictable.
Even much simpler documents may include a variety of image values such as color and shadings in letterhead, high contrast print, a watermark on the paper and a plurality of signatures. This range of image values that may be included in a document may limit the degree to which image data may be compressed when accurate image reconstruction is necessary. Therefore that cost of storage in such a form from which image reconstruction is possible with high fidelity to the original document is relatively large and such costs limit the period for which such storage is economically feasible, regardless of the desirability of maintaining such storage and the possibility of rapid electronic access for longer periods.
Since such image values must be accurately reproducible and utilization of the JPEG standard is desirable in order to accommodate widespread access and system intercompatibility, substantially the only technique for further reduction of data volume consistent with reproduction with good image fidelity is to reduce the spatial frequency of sampling of the original image. However, sampling inevitably produces aliasing and reduces legibility of small indicia, especially at low contrast. Currently, sampling at 100 dots or pixels per inch (about a reduction of one-third to one-sixth from the 300 dpi or 600 dpi resolutions of printers currently in common use) is considered to be the limit for adequate legibility of low-contrast indicia on personal checks. The American National Standards Institute (ANSI) standards committee for image interchange recommends 100 dpi as a minimum resolution. Most check applications use either 100 dpi or 120 dp

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Enhanced compression of documents does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Enhanced compression of documents, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Enhanced compression of documents will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3116694

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.