Pulse or digital communications – Bandwidth reduction or expansion – Television or motion video signal
Reexamination Certificate
1998-02-05
2001-06-26
Britton, Howard (Department: 2713)
Pulse or digital communications
Bandwidth reduction or expansion
Television or motion video signal
Reexamination Certificate
active
06252905
ABSTRACT:
TECHNICAL FIELD
This invention relates in general to an apparatus and method for encoding digital visual images including spatial (intra-picture) and temporal (inter-picture) compression, that is reducing both redundancy within a picture and redundancy between pictures. Redundancy within pictures is reduced, or even eliminated, by the use of the discrete cosine transform, quantization, and variable length encoding. Redundancy between pictures is reduced, or even eliminated, by the use of motion vectors. One aspect of intra-picture compression comprises bit regulation to achieve an optimized bit rate while still maintaining picture quality and detail. This invention is specifically directed to real-time encoding with compressed picture quality evaluation for optimal, dynamic encoding of video data within a variable bit rate bandwidth capability.
BACKGROUND OF THE INVENTION
Within the past decade, the advent of world-wide electronic communications systems has enhanced the way in which people can send and receive information. In particular, the capabilities of real-time video and audio systems have greatly improved in recent years. In order to provide services such as video-on-demand and video conferencing to subscribers, an enormous amount of network bandwidth is required. In fact, network bandwidth is often the main inhibitor in the effectiveness of such systems.
In order to overcome the constraints imposed by networks, compression systems have emerged. These systems reduce the amount of video and audio data which must be transmitted by removing redundancy in the picture sequence. At the receiving end, the picture sequence is uncompressed and may be displayed in real-time.
One example of an emerging video compression standard is the Moving Picture Experts Group (“MPEG”) standard. Within the MPEG standard, video compression is defined both within a given picture and between pictures. Video compression within a picture is accomplished by conversion of the digital image from the time domain to the frequency domain by a discrete cosine transform, quantization, and variable length coding. Video compression between pictures is accomplished via a process referred to as motion estimation and compensation, in which a motion vector plus difference data is used to describe the translation of a set of picture elements (pels) from one picture to another.
The ISO MPEG-2 standard specifies only the syntax of bitstream and semantics of the decoding process. The choice of coding parameters and tradeoffs in performance versus complexity are left to the encoder developers.
One aspect of the encoding process is compressing a digital video image into as small a bitstream as possible while still maintaining video detail and quality. The MPEG standard places limitations on the size of the bitstream, and requires that the encoder be able to perform the encoding process. Thus, simply optimizing the bit rate to maintain desired picture quality and detail can be difficult.
For example, a bit rate is defined in bits per second. Based on the frame rate and type of picture being encoded, a number of bits per picture is assigned. At 6,000,000 bits per second (6 Mbps), and pictures at 30 picture frames per second, each picture would be allocated 200,000 bits assuming that the bits are allocated uniformly. With a 720×480 picture having 1350 macroblocks, this translates into 148 bits allocated per macroblock. Thus, in the case of scene changes and action videos, the bit rate can be quickly consumed with drastic changes between macroblocks and/or between frames. Picture quality and detail can suffer as a result.
Video compression requires advanced techniques to achieve the highest amount of compression, while still obtaining a desired picture quality. Variable bit rate (VBR) encoding is an option in video compression that allows for each compressed picture to have a different amount of bits based on complexity of intra and inter-picture characteristics. For example, a scene with simple picture content (such as a color test pattern) will require significantly less bits to encode than a scene with complicated picture content (such as a crowded city street) in order to obtain desired picture quality. Because of the amount of information that is needed to characterize the video and the complexity of the algorithms needed to interpret the information to effectively enhance the encoding process, VBR encoding is conventionally accomplished in a non-real-time, two or more pass encoding process. In a first pass, statistics are gathered and analyzed, and in a second pass, the results of the analysis are used to control the encoding process. Although this produces high quality encoding, it does not allow for real-time operation.
This invention seeks to monitor or predict picture quality of a partially encoded video sequence within an encoder in order to optimize picture quality in real time within the video encoding system.
DISCLOSURE OF THE INVENTION
Briefly summarized, the invention comprises in a first aspect a method for encoding a sequence of video data. The method includes compressing the sequence of video data to produce a compressed signal by: compressing the sequence of video data n times, wherein n≧1, each compressing employing at least one different compression parameter, thereby producing n different compressed video signals; evaluating picture quality of the n compressed video signals; and automatically selecting one of the n compressed video signals for output as the compressed signal, the automatically selecting of one compressed video signal for output being based on the evaluating of picture quality of the n compressed video signals.
In another aspect, the invention comprises a method for encoding a video signal which includes encoding the video signal by: compressing the video signal to produce a compressed video signal; and automatically evaluating in real time data loss resulting from the compressing of the video signal and producing therefrom loss statistics representative of picture quality of the compressed video signal, wherein the automatic evaluating occurs during encoding of the video signal.
In still another aspect, the invention comprises a digital video encoder having a quantization subsystem for compressing a sequence of video data to produce a compressed signal. The quantization subsystem includes means for compressing the sequence of video data n times, wherein n≧1, and wherein each compressing of the video data employs a different compression parameter to thereby produce n different compressed video signals. The quantization subsystem further includes means for evaluating picture quality of the n compressed video signals in real time, and means for automatically selecting one of the n compressed video signals for output as the compressed signal, selection of the compressed signal being based upon the evaluation of picture quality thereof.
In another aspect, the invention comprises a digital video encoder which includes a quantizer and logic for automatically evaluating in real time data loss resulting from quantization of the video data. The quantizer produces a quantized video signal from a received video signal, while the logic for automatically evaluating evaluates data loss resulting from quantization of the received video signal. This evaluation of data loss is then used to decide whether to output the quantized video signal for further encoding or whether to requantize the received video signal using a different quantization scale.
In a further aspect, the invention comprises a computer program product including a computer usable medium having computer readable program code means therein for use in encoding the sequence of video data. The computer readable program code means in the computer program product includes computer readable program code means for causing a computer to affect compressing the sequence of video data to produce a compressed signal, the compressing including: (i) compressing the sequence of video data n times, wherein n≧1, and wherein each compres
Pokrinchak Stephen P.
Stein Charles J.
Britton Howard
Heslin & Rothenberg, P.C.
International Business Machines - Corporation
Steinberg, Esq. William
Vo Tung
LandOfFree
Real-time evaluation of compressed picture quality within a... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Real-time evaluation of compressed picture quality within a..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Real-time evaluation of compressed picture quality within a... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2444110