Mask generation for multi-layer image decomposition

Image analysis – Image segmentation – Distinguishing text from other regions

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C382S173000, C382S199000, C382S164000, C382S225000, C382S240000, C382S283000, C358S462000, C358S464000

Reexamination Certificate

active

06633670

ABSTRACT:

BACKGROUND OF THE INVENTION
The present invention relates to layered decomposition of images arid, more particularly, to the creation of a mask for layers of the digitally decomposed image.
The large size of digital data files required to represent images makes data compression an imperative when storing or transmitting images. On the other hand, compression can be problematic because many images comprise a combination of text, line-art graphics elements, and photographic elements and compression processes are commonly designed to be more effective with one type of image element than another. For example, the JPEG (Joint Photographic Experts Group) standard (ISO 10918) is designed to effectively compress the complex multi-color matrix of photographic elements. Annoying artifacts can appear in decompressed images, especially in the vicinity of sharp transitions which are common characteristics of graphical and textual elements. On the other hand, the compression process of the JBIG (Joint Bilevel Image Group) standard (ISO/IEC 11544:1993) utilizes arithmetic encoding and is particularly effective in compressing text and graphics but less effective in compressing natural photographic elements.
One method for improving the efficiency and results of image compression decomposes compound images into layers containing a type or types of elements that are effectively compressed using a single process. The data of each layer is then compressed with a process that is particularly effective with the type of data contained in the layer. The DRAFT ITU-T RECOMMENDATION T.44 “MIXED RASTER CONTENT (MRC),” International Telecommunication Union (ITU), Telecommunication Standardization Sector, October 1997, incorporated herein by reference, specifies the technical features of an imaging format based on segmentation of images or pages into multiple layers (planes) according to the type of image element and the application of encoding, spatial and color resolution processing specific to the type of image element comprising the layer. The ITU recommendation models a page or image as three layers; a background layer containing contone color(continuous tone and palletized color) element; a foreground layer containing text and line-art graphics, and a bi-level mask layer interposed between the background and foreground layers. The mask is used to select the layer (background or foreground) from which a pixel will be rendered in the recomposed image. The pixels of the mask layer act as a bi-level switch to select a spatially corresponding, pixel in the layer immediately above or below the mask layer. For example, if an exemplary mask layer pixel has a value of “1,” a spatially corresponding pixel might be selected from the background layer for rendering in the final image. However, if the mask layer pixel has. a value of “0” the corresponding pixel would be selected from the foreground layer. While the ITU recommendation provides for processing, interchange, and archiving images in multiple layers, it does not provide a method of generating a mask layer to facilitate layered decomposition of an image.
L. Bottou et al. describe a mask generation method in HIGH QUALITY DOCUMENT COMPRESSION WITH “DiVu”, JOURNAL OF ELECTRONIC IMAGING, Vol. 7, pp 410-425, 1998. An image is partitioned into square blocks of pixels of different sizes. Two dominant colors are identified for the pixels of each block. Cluster initialization is inherited from the previous, lower resolution (larger) block size. The pixels of each block are sorted into clusters according to the. closeness of their individual colors to one of the dominant colors of the block. An iterative, k-means algorithm is-used to sort the pixels for clustering. The iterative nature of the process increases the computational resources and the processing time required for mask creation.
D. Huttenlocher et al. describe a decomposition process in DIGIPAPER: A VERSATILE COLOR DOCUMENT IMAGE REPRESENTATION, Proceedings of the IEEE, International Conference on Image Processing, Kobe, Japan, October 24-25, 1999. The process utilizes token compression where a binary image is represented using a dictionary of token shapes and position information indicating where the token is to be drawn in the image. Segmentation of the image relies on attributes of text including the token representation of text as objects. As a result, the method is more effective with text than graphics.
What is desired, therefore, is method of layered image decomposition that is resource and time conservative and equally effective when decomposing a page or image into its text, graphical, and photographic elements.
SUMMARY OF THE INVENTION
The present invention overcomes the aforementioned drawbacks of the prior art by providing a method of generating a mask for a layered image decomposition comprising the steps of partitioning the image as a plurality of first sub-images and as a plurality of second sub-images dimensionally differing from the first sub-images, both the first and the second sub-images comprising pluralities of pixels; assigning a first sub-image mask value to an evaluation pixel according to a relationship of the luminance of the evaluation pixel and a sub-image luminance of the first sub-image; assigning a second sub-image mask value to the evaluation pixel according to a relationship of the luminance of the evaluation pixel and a sub-image luminance of the second sub-image; and setting a mask value for the evaluation pixel as a function of the first and second sub-image mask values for a plurality of pixels of the first and the second sub-images. The method is non-iterative and conserves computational resources and time which is important for on-line operations. Further, the method is equally effective for text, line art graphic, and photographic image elements.


REFERENCES:
patent: 5341226 (1994-08-01), Shiau
patent: 5432870 (1995-07-01), Schwartz
patent: 5539842 (1996-07-01), Schwartz
patent: 5745596 (1998-04-01), Jefferson
patent: 5778092 (1998-07-01), MacLeod
patent: 5802203 (1998-09-01), Black et al.
patent: 5956468 (1999-09-01), Ancin
patent: 6360022 (2002-03-01), Lubin et al.
patent: 6373981 (2002-04-01), de Queiroz et al.
Huang, et al “Segmentation-based hybrid coding using luminance masking”, IEEE, pp. 750-751, 1998.*
Aiyer et al., “A Fast, Table-Lookup Algorithm for Classifying Document Images” IEEE/ICIP Kobe Japan Proceedings, Oct. 1999.
Mukherjee et al., “Document Page Segmentation Using Multiscale Clustering” IEEE/ICIP Kobe Japan Proceedings, Oct. 1999.
Said et al., “Simplified Segmentation for Compound Image Compression” IEEE/ICIP Kobe Japan Proceedings, Oct. 1999.
Li et al., “Text and Picture Segmentation by the Distribution Analysis of Wavelet Coefficients” IEEE/ICIP Chicago, IL Proceedings, Oct. 1999.
Li et al., “Context Based Multiscale Classification of Images” IEEE/ICIP Chicago, IL Proceedings, Oct. 1999.
Lu, “Detection of Text Regions from Digital Engineering Drawings” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 20, No. 4, Apr. 1998.
Etemad et al., “Multiscale Segmentation of Unstructured Document Pages Using Soft Decision Integration,” IEEE Transactions on Pattern Analysis and Machine Intelligenc, vol. 19, No. 1, Jan. 1997.
Savakis, Adaptive Document Image Thresholding Using Foreground and Background Clustering, IEEE/ICIP Chicago, IL Proceedings, Oct. 1999.
Simon et al., “A Fast Algorithm for Bottom-Up Document Layout Analysis” IEEE Transactions on Pattern Analysis and Machine Intelligenc, vol. 19, No. 3, Mar. 1997.
Draft ITU-T Recommendation T.44 “Mixed Raster Content (MRC)”, International Telecommunication Union, Study Group 8 (Contribution (10/97).
Bottou et al., “High Quality Document Image Compression with DjVu” Jul. 13, 1998.
Haffner et al., Color Documents on the Web with DjVu IEEE/ICIP Kobe Japan Proceedings, Oct. 1999.
Huttenlocher et al., “Digipaper: A Versatile Color Document Image Representation” IEEE/ICIP Kobe Japan Proceedings, Oct. 1999.
Queiroz, “Compression of Compound Documents” IEEE/ICIP Kobe Japan Proceedings, O

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Mask generation for multi-layer image decomposition does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Mask generation for multi-layer image decomposition, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Mask generation for multi-layer image decomposition will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3152123

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.