Automated layer extraction and pixel assignment from image...

Image analysis – Image segmentation

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C382S236000, C382S107000

Reexamination Certificate

active

06668080

ABSTRACT:

FIELD OF THE INVENTION
This invention relates generally to three-dimensional (3D) scene modeling in which the scene is made up from a number of images, and more particularly to automated layer extraction from the images making up such a scene, and automated pixel assignment of the pixels of the images to a particular layer of the layers that have been extracted.
BACKGROUND OF THE INVENTION
One type of graphics application for computers and other such devices is three-dimensional (3D) scene modeling. Generally, in 3D scene modeling, a 3D scene is modeled from a sequence of two-dimensional (2D) images taken of the scene by cameras placed at various locations around and/or within the scene. This sequence of 2D images allows for the creation of a 3D geometric model of the scene, including in some instances what is known as a texture map that captures the visual appearance of the scene. The texture map is a 2D bitmapped image of the texture of a surface of the 3D scene, such as a uniform texture (e.g., a brick wall), or an irregular texture (e.g., such as wood grain or marble). The texture map is then “wrapped around” geometric objects within the 3D scene.
In another approach, the sequence of 2D images provides for the creation of the 3D scene by decomposing the scene into a collection of 3D layers, or sprites. Each 3D layer includes a plane equation, a color image that captures the appearance of the sprite, a per-pixel opacity map, and a per-pixel depth-offset relative to the nominal plane of the layer. A generative model for this approach—that is, constructing the 3D layers of the 3D scene from the sequence of 2D images—is described in the reference S. Baker, R. Szeliski, and P. Anadan, A layered approach to stereo reconstruction, in IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR '98), pages 434-441, Santa Barbara, June 1998.
However, the approach of decomposing a 3D scene into 3D layers has disadvantages as the approach is known in the prior art, including as described in the reference of the previous paragraph. First, recovery of the 3D layers from the sequence of 2D images for constructing a model of a 3D scene is accomplished by manual input. Among other things, this means that a user must painstakingly divide a scene into 3D layers, based on at least large part on subjective criteria that may be different to each user. Thus, the decomposition approach is not well suited to automatic rendering of 3D scenes as is the case with texture mapping approaches, and therefore may not be selected as the utilized approach, even if it were to yield better results than other approaches.
Second, assigning pixels of the 2D images to the extracted 3D layers is known to be required, but a complete methodology as to how to assign pixels to layers as a general matter is not known. This means that pixels of images are assigned to layers on an ad hoc basis, as opposed to following a formal methodology, or, better yet, having an automated manner of pixel assignment to layers. This disadvantage of the decomposition approach also militates against the use of the approach to automatic rendering of 3D scenes, such that other approaches, such as texture mapping approaches, may instead be chosen for rendering, even if these alternative approaches yield less optimal results.
For these and other reasons, then, there is a need for the present invention.
SUMMARY OF THE INVENTION
The invention relates to automated layer extraction from a number of 2D images that are formed by different cameras viewing a 3D scene from different viewpoints, and the automated assignment of pixels of the images to the extracted layers. As used herein, layers are also referred to as sprites. In one embodiment, a computer-implemented method is operable on a number of 2D images of such a 3D scene, where each 2D image has a number of pixels that correspond to the pixels of the other images. The method determines a number of planes of the scene, and assigns pixels of the images to one of the planes. At least the planes of the scene are then output.
In one embodiment, the method determines the number of layers via a statistical estimation approach that embodies notions of physical coherence of surfaces and objects. These include: (i) that the pixels belonging to the same layer should approximately form a planar region in 3D space (i.e., their combined offsets relative to a plane should be small; (ii) that nearby pixels in an image are likely to belong to the same layer; and, (iii) the image appearance of the different portions of a layer should be similar. The method of this particular embodiment uses Bayesian reasoning techniques, as known within the art, and in so doing embodies the notions of physical coherence in terms of Bayesian “prior probabilities” regarding the physical description of the scene, and the evidence provided by the images as “likelihoods” associated with the specific layered decomposition of the scene. The “posterior probabilities” associated with different possible layer decompositions (i.e., the number of layers where the number is between one and some predefined maximum possible value n, and the associated pixel assignments) are evaluated and the most likely decomposition as determined by an estimation algorithm is chosen.
The number of planes can be first determined by using the high-entropy pixels of the images. These are pixels which have distinct image appearance such as comers of regions or highly textured points (as opposed to, for example, areas which are homogeneous in color). Also in one particular embodiment, the method assigns all pixels of the images, other than the high-entropy pixels, to the planes via an iterative Expectation Maximization-type approach based on Bayesian decision criteria.
Embodiments of the invention provide for advantages not offered by the prior art. Foremost, embodiments of the invention provide for an automated manner by which 3D layers are extracted from the 2D images making up the scene, and for an automated manner by which pixels of the images are assigned to these extracted layers. This allows for the layer decomposition approach to 3D scene modeling to be automated, such that it becomes a more attractive approach to such modeling as compared to other approaches that are already automated, such as texture mapping approaches.


REFERENCES:
patent: 5557684 (1996-09-01), Wang et al.
patent: 5706417 (1998-01-01), Adelson
Wang et al. “Representing Moving Images with Layers”. IEEE Transactions on Image Processing. V. 3, Issue 6. Sep. 1994, p. 625-638.*
Elias et al. “The Recovery of a Near Optimal Layer Representation for an Entire Image Sequence”. International Conference on Image Processing. V. 1. Oct. 1997. p. 735-738.*
Chang et al. “Simultaneous Motion Estimation and Segmentation”. IEEE Transactions on Image Processing. V. 6, Issue 9. Sep. 1997. p. 1326-1333.*
Ayer et al. “Layered Representation of Motion Video using Robust Maximum-Likelihood Estimation of Mixture Models and MDL Encoding”. Fifth International Conference on Computer Vision. Jun. 1995. p. 777-784.*
Borshukov et al. “Motion Segmentation by Multistage Affine Classification”. IEEE Transactions on Image Processing. V. 6, Issu 11. Nov. 1997. p. 1591-1594.*
R. Szeliski, A multi-view approach to motion and stereo. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'99), Fort Collins, Jun. 1999.
J.Y.A. Wang and E.H. Adelson, Layer representation for motion analysis. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR '93), pp. 361-366, New York NY, Jun. 1993.
S. Baker, R. Szeliski, P. Anandan, A layered approach to stereo reconstruction. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'98), pp. 434-441, Santa Barbara, Jun. 1998.
J. Bergen, P. Anandan, et al., Hierarchical Model-Based Motion Estimation, Proc. 2nd European Conference on Computer Vision, pp 237-252, 1992.
Martin A. Fischler and Robert C. Bolles, Random Sample Consensus: A Paradigm for Model Fitting

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Automated layer extraction and pixel assignment from image... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Automated layer extraction and pixel assignment from image..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Automated layer extraction and pixel assignment from image... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3171680

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.