Video encoding and decoding method

Pulse or digital communications – Bandwidth reduction or expansion – Television or motion video signal

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C375S240290

Reexamination Certificate

active

06381276

ABSTRACT:

FIELD OF THE INVENTION
The present invention relates to a method for encoding a sequence of frames by means of a three-dimensional (3D) subband decomposition involving a spatial and temporal filtering step applied to the 3D spatio-temporal data which correspond to said sequence of frames considered as a 3D volume, said decomposition being applied to successive groups of frames only after the implementation, on said frames, of motion estimation and compensation operations in the low temporal subbands at each level of the temporal decomposition, the compensation operation leading, among the filtering pixels, to double-connected pixels corresponding to regions becoming uncovered during motion. The invention also relates to a method for decoding signals previously coded by means of such an encoding method.
BACKGROUND OF THE INVENTION
Video compression schemes operate by removing redundant information from the signals before their transmission and then reconstructing, at the decoder side, an approximation of the image from the remaining compressed information. The temporal redundancy corresponds to the fact that pixel values are not independent but are correlated with their neighbours across successive frames. The reduction of this temporal redundancy is mainly achieved by two types of approaches: the “hybrid” or predictive approach, according to which a prediction of the current frame is computed based on the previously transmitted frames and only the prediction error, i.e. the difference between the current frame and its prediction, is intra-coded and transmitted, and the three-dimensional (3D, or 2D+t) approach in which the temporal redundancy is exploited by means of a temporal transform, in a similar way to the spatial techniques for removing the spatial redundancy (which corresponds to the fact that pixel values are not independent within the same frame).
In this 3D approach, the sequence of frames is processed as a 3D volume and thus the classical subband decomposition used in image coding can be extended to 3D spatio-temporal data by using separable transforms, for example wavelet or wavelet packets transforms implemented by means of filter banks. There is an anisotropy in such a 3D structure, but it can be taken into account by using different filter banks in the spatial and temporal directions (usually, Haar filters are used for temporal filtering since the added delay of using longer filters is undesirable ; furthermore, Haar filters are the only perfect reconstruction orthogonal filters which do not present the boundaries effect, since they are two-tap filters).
The coding efficiency of such a 3D coding scheme can be improved by performing motion estimation/compensation in the low temporal subbands at each level of the temporal decomposition. It was shown, for instance in “Three-dimensional subband coding with motion compensation”, by J. R. Ohm, IEEE Transactions on Image Processing, vol. 3, No5, September 1994, pp.559-571, that motion-compensated 3D subband coding leads to better visual and peak signal-to-noise ratio (PSNR) results than classical predictive schemes, for a given bitrate (in said document, a 3D subband decomposition with full motion compensation is performed using Haar filters in the temporal domain and a 2D discrete wavelet transform—or DWT—in the spatial domain).
However, the motion compensation raises the problem of points which are filtered twice or not filtered at all, as illustrated in FIG.
1
. In the left part of
FIG. 1
, A and B respectively designate previous and current frames, and (a
1
to a
6
), (b
1
to b
6
) are pixels of said frames respectively (BBY=block boundary). The right part of
FIG. 1
illustrates the process of motion compensation temporal filtering combined with block matching. As the connected pixels are filtered along the motion trajectory, and as matched blocks overlap with neighbouring blocks (except in case of no motion or translational motion), double-connected and unconnected pixels generally appear.
The strategy adopted in said document to cope with unconnected pixels is the following: for an unconnected pixel in the current frame (like b
1
), the original pixel value is inserted into the temporal low (L) subband. An unconnected pixel in the previous frame (like a
3
and a4) is related to the same motion vector as its neighbours. The result of the high-pass filtering in the motion direction is put into the temporal high (H) subband at the unconnected pixel location (in the case of Haar filters, this represents the displaced frame difference). It is also proposed to perform the motion compensated prediction for the unconnected pixels in the previous frame by using the reconstructed frame just previous to it. Temporal subbands resulting from filtering replace the original frames: the temporal low (L) subband takes the place of the current frame, and the temporal high (H) subband replaces the reference frame.
Another method to cope with unconnected pixels is described in the document “Motion-compensated 3-D subband coding of video”, by S. J. Choi and al, IEEE Transactions on Image Processing, vol. 8, no2, February 1999, pp.155-167, with reference to FIG.
2
. For an unconnected pixel in the previous frame (like a
3
and a4 ), the original value is inserted into the temporal low subband, which gives better visual results than the strategy defined in the first cited document. For an unconnected pixel in the current frame (like b
1
), a DFD (displaced frame difference) value is taken. By making the direction of motion estimation agree with that of motion compensation, the best available motion vector can be used for this DFD value, without the need of an additional motion estimation. The temporal subbands are relocated differently, as shown in the right part of FIG.
2
: temporal low (L) subband replaces the reference frame, while temporal high (H) subband replaces the current frame. This is better suited in the case of unconnected pixels than the strategy of the first cited document, since the high-frequency subbands have smaller energy and are compatible with a DFD value for the unconnected pixels.
To sum up this approach, the following relations (1) to (3) are used:
H
[
m
,
n
)
=
1
2

(
B

[
m
,
n
]
-
A
~

[
m
-
d
m
,
n
-
d
n
]
)
,


(
1
)
L

[
m
-
d
_
m
,
n
-
d
_
n
]
=


1
2



(
B
~

[
m
-
d
_
m
+
d
m
]
+


A

[
m
-
d
_
m
,
n
-
d
_
n
]
)



for



connecting



pixels
,


(
2
)
and





L

[
m
,
n
]
=
2
·
A

[
m
,
n
]



for



unconnected



pixels
,
(
3
)
where (m,n) designates the position of the pixels, H[m,n], L[m,n] are temporal high- and low-subbands, A[m,n], B[m,n] are the previous and current frames, [d
m
, d
n
] is for a pixel the motion vector estimated between the two considered consecutive frames A and B, {overscore (d)}
m
, {overscore (d)}
n
are the nearest integers to d
m
, d
n
, and , {tilde over (B)} are interpolated values if motion pixels are subpixel accurate. In case of multiple connected pixels, A is calculated with the first pair (A, B) encountered.
In fact, the problem of unconnected and double connected pixels appears as closely related to that of covered/uncovered areas during motion. Indeed, when two objects having a common part in the frame at time T become separate at time T+1, two regions in the current frame will correspond, by motion compensation, to the same region in the reference frame: this is the uncovered area, that appears as double connected in the case of the previous analysis giving the relations (1) to (3). A possible situation is illustrated in
FIG. 3
, where FA designates a foreground area, belonging to a foreground object
1
, and UR the uncovered area of a background object
2
when said regions become distinct (i.e. they do not overlap). The approach described in the second document associates to these p

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Video encoding and decoding method does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Video encoding and decoding method, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Video encoding and decoding method will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2861760

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.