Shape information coding and decoding apparatus for...

Image analysis – Image compression or coding – Shape – icon – or feature-based compression

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C382S239000, C375S240080

Reexamination Certificate

active

06553149

ABSTRACT:

BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention relates to video coding and decoding and more particularly to a shape information coding and decoding apparatus for adaptively bordering and method therefor, wherein bordering is performed with respect to blocks of shape information when forming contexts in order to context-based arithmetic encode a shape in a picture.
2. Description of Related Art
There is a case that only a certain particular object in a frame is intended to be processed to increase coding efficiency or improve picture quality in processing video information. In this case, shape information of the particular object is required to separate the particular object from a background.
A block of a particular object's shape information includes pixels for the particular object, the pixels having a specified value (for example, “1”), and pixels for others except the object, the pixels having a specified value “0”. The shape information of the particular object is divided into blocks of a predetermined size (for example, 16×16 and 8×8) and a specified operation is performed with respect to ambient pixels of the pixels to be coded in order to code the shape information within a block.
FIG. 1
is a block diagram of a conventional object-based video coder.
Concepts of a shape coder and a video object planes (VOP) are introduced here. The VOP indicates an object at a certain point in a time domain of a content having a predetermined shape which can be accessed and edited by a user. The information should be coded by each VOP for support of a content-based functionality.
Primarily, signals of a picture are classified into shape information and texture information and two types of information are respectively inputted to a shape coding unit
11
and motion estimation unit
12
.
The shape coding unit
11
performs lossy coding or lossless coding with respect to the shape information of a relevant frame. Reconstructed shape information is inputted to both motion compensation unit
13
and texture coding unit
17
. Both the motion compensation unit
13
and texture coding unit
17
operate based upon an object. A shape information bit stream, which is another output of the shape coding unit
11
, is inputted to a multiplexer
18
.
The motion estimation unit
12
estimates motion information of a current frame's texture information using input texture information of the current frame and texture information of a previous frame which is stored in a previous reconstructed frame memory
14
. The estimated motion information is inputted to the motion compensation unit
13
while a motion information bit stream is encoded and inputted to the multiplexer
18
. The motion compensation unit
13
performs motion compensation using the motion information obtained through the motion estimation unit
12
and the previous reconstructed frame received from the previous reconstructed frame memory
14
.
The texture coding unit
17
codes a prediction error. The prediction error is a difference between input texture information obtained through a subtracter
15
and motion compensated texture information obtained through the motion compensation unit
13
. A texture bit stream which is generated through the coding at the texture coding unit
17
is inputted to the multiplexer
18
and an error signal of reconstructed texture information is inputted to an adder
16
. The previous reconstructed frame memory
14
stores a previous reconstructed frame signal received from the adder
16
. The previous reconstructed frame signal is obtained by adding the error signal of the reconstructed texture information to the motion compensated signal.
Digital video may be classified into progressive video and interlaced video according to frame constructing methods. For the progressive video, a frame is constructed in such a manner that lines consecutively progresses from the top to the bottom. For the interlaced video, a frame is constructed in such a manner that a field of odd lines is primarily constructed and then even lines in the other field are interlaced with the odd lines of the first field. A height (the number of lines) of the field is a half of the height of the frame. This is illustrated in
FIGS. 2
a
and
2
b.
FIG. 2
a
shows a frame of the progressive video and
FIG. 2
b
shows two fields -a top field and a bottom field- and a frame of the interlaced video. In
FIGS. 2
a
and
2
b,
the top and bottom fields consist of lines (solid arrows in the top field and dotted arrows in the bottom field) and the lines of each field interlace (the solid arrows are interlaced with the dotted arrows) to construct an interlaced frame.
When the top and bottom fields are constructed, as shown in
FIG. 2
b,
there is a time gap between the two fields and the top field precedes the bottom field. In other cases, the bottom field may precede the top field. For the lines forming a frame in an interlaced video, the lines constructing the top field and the lines constructing the bottom field are separately scanned by each field. Because of the time gap between the top field and the bottom field, signal characteristics of neighboring lines in the interlaced frame can be different.
Particularly, in case of a picture having a lot of motion, this feature described above is prominent. When applying video coding functions developed in accordance with properties of the progressive video, such as motion estimation, motion compensation, and discrete cosine transform (DCT), to the coding of the interlaced video, reduction of coding efficiency is caused. Technology, such as field-based motion estimation and compensation and adaptive frame/field DCT, has been developed to overcome this problem. Such technology is disclosed in the standard MPEG-2 established by the ISO/IEC JTC1/SC29/WG11 for applications of digital TV and the like. The technology has been frequently applied to actual application products.
FIGS. 3
a
and
3
b
show interlaced shape information where an object has no motion or a little motion between two fields. As shown in
FIG. 3
a,
correlation between lines in a frame is higher compared with that in each field, so it is better to code the shape information from a frame than from each field in this case.
FIGS. 4
a
and
4
b
show shape information where an object has much motion between two fields. As shown in
FIG. 4
a
where the lines are grouped into each field, variation between shape information of each line is little and correlation between lines is high in the same field. However, as shown in
FIG. 4
a,
when considering a whole frame, the variation between shape information of each line is larger, so the correlation between lines is lower. Therefore, coding efficiency is reduced when coding the shape information from the frame.
It is best to adaptively select one between a field coding mode and a frame coding mode rather than to use only one mode when coding the interlace shape information.
FIG. 5
a
shows a context for performing context-based arithmetic encoding (CAE) in an INTRA mode. A value of a pixel
51
is encoded through a specified operation using pixels C
0
to C
9
neighboring with the pixel
51
to be encoded.
FIG. 5
b
shows a context for performing the CAE in an INTER mode. A value of a pixel
52
is encoded through a specified operation using pixels C
0
to C
3
neighboring with the pixel
52
to be encoded in a current block and a pixel C
6
corresponding to the pixel
52
and its neighboring pixels C
4
, C
5
, C
7
, and C
8
in a previous frame.
When coding shape information of a particular object, the information is divided into binary alpha blocks (BABs) of a predetermined size, for example, 16×16. In this case, bordering is performed to construct contexts of outer pixels of a BAB. As shown in
FIGS. 5
a
and
6
, if the pixel
51
to be encoded is located in the left border of a BAB
61
, values of the pixels C
0
, C
1
, C
5
, C
6
, and C
9
in
FIG. 5
a
cannot be acknowledged, so the BAB
61
is bordered by a left border
63
an

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Shape information coding and decoding apparatus for... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Shape information coding and decoding apparatus for..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Shape information coding and decoding apparatus for... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3029064

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.