Method for temporally smooth, minimal memory MPEG-2 trick...

Motion video signal processing for recording or reproducing – Local trick play processing – With randomly accessible medium

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C386S349000

Reexamination Certificate

active

06658199

ABSTRACT:

BACKGROUND OF THE INVENTION
The present invention relates to a video recorder and, more particularly, to a method for constructing trick play mode video displays from an MPEG-2 digital video transport stream using a digital video recorder.
A conventional analog video recorder records the video signal in its transmitted analog format (such as, the NTSC signal format). At play time, the recorded signal is transmitted over a cable to a display device which is capable of displaying signals of the transmitted format. In addition to the standard play mode (forward direction, standard speed), analog video recorders are capable of displaying video in several “trick play” modes. Trick play modes include fast forward play, slow forward play, fast reverse play, slow reverse play, and pause. Consumers are likely to expect that a video recorder used in conjunction with digital television (DTV) will have, at least, the same trick play mode capabilities as analog video recorders. However, the MPEG-2 data compression techniques used with DTV make creation of trick play modes from the DTV transport data stream problematic, particularly over a simple, bit rate limited, communication channel between a video recorder and a display device.
Motion video comprises a sequence of images or frames. The images are originally recorded as analog signals. For digital television, the analog signals for a video element of a program are input to an encoder that converts the signals to digital data, compresses the digital data, and combines the digital video data with data related to the audio and data elements of the program to output a single transport data stream. The transport data stream is transmitted to a receiver where a decoder reverses the process to produce a close approximation of the original analog signal for presentation to the viewer. The quantity of data resulting from converting analog signals to digital signals is so great that digital motion video would be impractical if the data could not be compressed. However, there is considerable data redundancy within an image and between the images of a video sequence. MPEG-2 provides a toolkit of techniques that can be used to reduce this redundancy and, thereby, reduce the quantity of data required to digitally describe the images of the video sequence.
The DTV system is based on the MPEG-2 Main profile which provides for three types of video frames (I-, P-, and B- frames). Typically, the succession of frames comprising a video sequence is divided for convenience into groups of frames or groups of pictures (GOP). Each GOP is anchored by an entirely self-coded (intracoded) frame or I-frame. Intracoding data compression techniques are used to reduce data redundancy within a single image, but all of the data necessary to decode and reconstruct an I-frame is transmitted. Since I-frames require a relatively large quantity of data, the number of I-frames is minimized. However, I-frames are periodically required in the data stream to enable recovery of the video stream after channel switching or error outages, and the MPEG-2 standard requires an I-frame at least every 132 frames. P-frames and B-frames are produced with interframe data compression as well intraframe data compression. Interframe data compression uses motion estimation to predict the content of a frame from the content of one or more other reference frames. P-frames are frames which are forward predicted from a previous reference frame (either an I- or P- frame). Data for a P-frame includes motion estimation vectors describing movement of blocks of pixels between the current frame and the frame on which prediction is based and the differential data which must be added to the blocks of the earlier frame to construct the image of the later P-frame. A P-frame requires roughly half the data of an I-frame. On the other hand, a B-frame is bidirectionally predicted from earlier and later reference frames. B-frame data comprises motion estimation vectors describing where data should be taken from the earlier and later frames and typically requires about one-fourth the data of an I-frame. B-frames are used to increase the compression efficiency and perceived picture quality but cannot be used to predict future frames.
MPEG-2 provides flexibility as to use, size, and make up of the GOP, but a 12-frame GOP is typical for a 25 frames per second system frame rate and a 15-frame GOP is typical for a 30 frames per second system. An exemplary 15 frame GOP might comprise the following frames transmitted in the following order:
. . . I
0
, B
0
, B
1
, P
0
, B
2
, B
3
, P
1
, B
4
, B
5
, P
2
, B
6
, B
7
, P
3
, B
8
, B
9
. . .
At the decoder, the transport stream is decoded, decompressed and reordered to reconstruct the images of the original video image sequence. Since the data from earlier frames must be available to predict and reconstruct later frames, the order of transmission of frames will be different from the order in which the frames will be displayed. This requires the encoder and decoder to reorder the frames, even for standard play mode. In standard forward play mode the frames of this exemplary GOP would be displayed in the following order:
. . . B
0
, B
1
, I
0
, B
2
, B
3
, P
0
, B
4
, B
5
, P
1
, B
6
, B
7
, P
2
, B
8
, B
9
, P
3
. . .
The I-frame (I
0
) is the third frame displayed but must be transmitted first so that P
0
, B
0
, and B
1
can be decoded. Likewise, P
0
is transmitted before B
2
and B
3
because P
0
and I
0
are necessary to decode the B-frames (B
2
and B
3
) The exemplary GOP is an “open” GOP having a prediction link to a prior GOP. The initial B-frames (B
0
and B
1
) are decoded from the data of frame I
0
and the last P-frame (P
3
) of the previous GOP. MPEG also provides for a closed GOP with no prediction links to frames outside of the GOP. As a result of bidirectional prediction and the temporally forward nature of MPEG-2 compressed digital motion video, the trick play modes that can be created by selecting frames from the transport stream are very limited and reversing the order in which frames are transported is not useful for creating reverse play display modes.
One method used to provide trick play with recorders of MPEG-2 digital video is to first decode and store an entire GOP in the forward direction. The trick play system can then select an appropriate number of frames and a display order to create the trick play video display from the decompressed and decoded frames. However, the decoder must have large and costly frame buffers to store the decompressed versions of all the frames in the GOP. Since this is not required for normal forward play, the cost of the decoder would be substantially increased which would increase the cost of the receiver or video recorder. In addition, the transmission channel between the recorder and the display could easily be overwhelmed by the quantity of decompressed data required for a trick play display, especially in a fast play mode. Further, this technique requires that the entire GOP be decoded, even during fast play modes. To do this, the decoder must be capable of decoding multiple frames in a single normal frame decoding period. Most decoders do not have this capability.
A second method of providing trick play modes is to decode and display only the I-frames of each GOP. An I-frame includes all of the necessary data to decode the frame and, therefore, the I-frames of a video sequence can be decoded and displayed in any order. Since I-frames are typically only one frame in 12 to 15 frames, each I-frame would be displayed for as many frame periods as are required to create the desired frame rate. However, video produced by displaying only the I-frames has a jerky quality because of the large gaps in the content produced by discarding the intervening P- and B-frames.
In a third method of creating a trick play video display sequence, frames are decoded but are not displayed until a frame that has been selected for the trick play video display is reached. The desired frame is then decoded and displayed. Since the method do

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method for temporally smooth, minimal memory MPEG-2 trick... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method for temporally smooth, minimal memory MPEG-2 trick..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for temporally smooth, minimal memory MPEG-2 trick... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3130127

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.