Coding video dissolves using predictive encoders

Image analysis – Image compression or coding – Interframe coding

Reexamination Certificate

Rate now

[ 0.00 ] – not rated yet Voters 0 Comments 0

Details Coding video dissolves using predictive encoders Coding video dissolves using predictive encoders

: 2000-03-06
: 2004-08-03
: Boudreau, Leo (Department: 2721)
: Image analysis
: Image compression or coding
: Interframe coding

: C382S239000
: Reexamination Certificate
: active
: 06771825
: ABSTRACT:

BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention relates to video compression processing, and, in particular, to the coding of video dissolves using predictive encoders.
2. Description of the Related Art
Predictive video encoders, such as those conforming to an MPEG video compression standard, gain much of their compression capability by making predictions from other, previously coded frames. MPEG coders have three main types of frames: I, P, and B. An I frame is coded independently using intra-frame encoding techniques without reference to any other frames. A P frame is coded using inter-frame encoding techniques as the motion-compensated difference between itself and the previously-coded P or I frame. P and I frames are referred to as “anchor” frames, because they can be used as references for coding other frames. Depending on which particular MPEG B-frame encoding mode is enabled and depending on which prediction technique provides the best coding results, each macroblock in a B frame may be coded (1) using forward prediction as the difference between itself and the previous anchor frame, (2) using backward prediction as the difference between itself and the next anchor frame, (3) using interpolated or bidirectional prediction as the difference between itself and the average of the previous and next anchor frames, or (4) as an intra-coded block without any prediction from an anchor frame. Many MPEG coders simply apply a repeating pattern of I, P, and B frames. For example, a typical 15-frame GOP (group of pictures) pattern may consist of the frame sequence (IBBPBBPBBPBBPBB) repeated over the coded video stream.
A “dissolve” is a common technique used in video production to transition between two scenes. A dissolve is a gradual transition from a preceding scene to a subsequent scene that occurs over a number of consecutive frames. Each frame in a dissolve is the weighted average on a pixel-by-pixel basis of two images—one image from the preceding scene and the other from the subsequent scene, where pixel Dij in the i
th
row and j
th
column of a particular dissolve frame is given by Equation (1) as follows:
Dij
=(
Aij
)*(1
−k
)+(
Bij
)*(
k
) (1)
where Aij is the corresponding pixel in the corresponding image from the previous scene, Bij is the corresponding pixel in the corresponding image from the subsequent scene, and k is a weighting factor that starts at 0 at the first frame of the dissolve and increases to 1 at the last frame of the dissolve. The rest of the frames in a dissolve (where 0<k<1) are referred to as “mixed-video” frames, because they are formed as a mixture (i.e., the weighted average) of frames from two different scenes. Note that, in a dissolve, either the previous scene or the subsequent scene or both may correspond to still images. A fade to or from black (or white or any other uniform color) is just a special case of such still-image-based dissolves.
Dissolves are notoriously difficult to encode because the various prediction tools in MPEG algorithms do not work very well to predict the “mixed video” frames that make up a dissolve. For typical scene changes, no amount of motion compensation will yield a good prediction. For MPEG coders that apply a repeating pattern of I, P, and B frames over the coded video stream, depending on how long the frame pattern is relative to the length of the dissolve and the relative phasing of the frame pattern with respect to the dissolve, prediction errors over successive P frames during a dissolve can build up to a level where the corresponding decoded frames are very distorted.
SUMMARY OF THE INVENTION
The present invention is directed to a technique for improving the efficiency of coding dissolves in video streams. According to certain embodiments of the present invention, the coding of dissolves is constrained to ensure that, other than the first frame and/or the last frame, no other frame in a dissolve is coded as an anchor frame (e.g., an MPEG I or P frame). In these embodiment, the present invention constrains video coding such that all intermediate (e.g., mixed-video) frames in dissolves are coded as non-anchor frames (e.g., MPEG B frames). In other embodiments, one or more of the intermediate frames may be coded as anchor frames (e.g., MPEG I or P frames), with the rest of the mixed-video frames coded as B frames, where the one or more I or P frames are restricted to particular frame locations within the dissolve. For typical video dissolves, the present invention provides efficient coding in terms of both the bit rate of the corresponding compressed video bitstream as well as the quality of the corresponding decoded video images.
According to one embodiment, the present invention is a method for coding a video stream using a video compression algorithm that supports both intra-frame coding and inter-frame coding, comprising the steps of (a) selecting first and last frames corresponding to an n-frame dissolve between a previous scene and a subsequent scene in the video stream; and (b) constraining the coding of the n frames of the dissolve such that either (1) no intermediate frame in the dissolve falling between the first and last frames is coded as an anchor frame or (2) only one or more intermediate frames at one or more specific locations within the dissolve are coded as anchor frames, where the one or more specific locations are functions of the number n of frames in the dissolve and all other intermediate frames are coded as non-anchor frames.

REFERENCES:
patent: 5911008 (1999-06-01), Niikura et al.
patent: 6005621 (1999-12-01), Linzer et al.
patent: 6301428 (2001-10-01), Linzer

Affiliated with

Hurst, Jr. Robert Norman

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Also associated with

Boudreau Leo

Examiner

[ 0.00 ] – not rated yet Voters 0 Comments 0

Burke, Esq. William J.

Attorney

[ 0.00 ] – not rated yet Voters 0 Comments 0

Dang Duy M.

Examiner

[ 0.00 ] – not rated yet Voters 0 Comments 0

Sarnoff Corporation

Corporate Assignee

[ 0.00 ] – not rated yet Voters 0 Comments 0

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Coding video dissolves using predictive encoders does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Coding video dissolves using predictive encoders, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Coding video dissolves using predictive encoders will most certainly appreciate the feedback.

Rate now

Comments { 0 }

Profile ID: LFUS-PAI-O-3309071

All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.

Canada

Charities
Companies
MP Candidates
Patents
Employee Salary Disclosure

World

Places of the World
Scientific Papers

United States

Banks
Companies
Counties
Patents
Employee Salary Disclosure