Coded data generation or conversion – Analog to or from digital conversion – Analog to digital conversion
Reexamination Certificate
2000-11-02
2002-02-26
JeanPierre, Peguy (Department: 2819)
Coded data generation or conversion
Analog to or from digital conversion
Analog to digital conversion
C348S014160
Reexamination Certificate
active
06351233
ABSTRACT:
BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention relates to a videoconference terminal and an image/voice regeneration method to be used for this videoconference terminal, and relates, more particularly, to a method of regenerating an image and voice at a videoconference terminal.
2. Description of Related Art
In the transmission of image and voice signals by using a videoconference terminal, the received timings of the image signal and voice signal after the compression and expansion of these signals become different from the timings when the original signals were transmitted. This is because the time required for compressing and expanding the image signal is different from the time required for compressing and expanding the voice signal.
In general, an image signal requires a larger amount of processing information than a voice signal. Therefore, at the receiving end, the processing of the image signal is completed later than the voice signal. This results in an unnatural regeneration of the signals that the image appears after the voice.
In order to solve this problem, there has been a conventional videoconference terminal that has a function of selecting from among preset fixed values a delay volume for delaying the timing of regenerating the voice at the receiving end. When the videoconference terminal having this function is used, it is possible to match to some extent the timing of regenerating the image with the timing of regenerating the voice.
As another conventional technique for solving the above problem, there has been an image/voice synchronization system of the MPEG-2 (Moving Picture Experts Group phase 2) that is one of motion picture compression systems.
According to the MPEG-1 (Moving Picture Experts Group phase 1) system and the MPEG-2 system (hereinafter to be collectively referred to as the MPEG system), each of an image packet and a voice packet has own time stamp called PTS (Presentation Time Stamp).
The PTS is stored in the header (packet header) of the image packet and the voice packet respectively at the time of transmitting the signals to the receiving end.
FIG. 1
shows a state that the image and the voice are multiplexed with the packet by the MPEG-2. The packet header is embedded in the image packet and the voice packet respectively. A value of the PTS is stored in the packet header.
In the mean time, at the receiving end, there is a counter (STC: System Time Clock) that is accurately synchronized with the transmitting end. A decoder regenerates the image and the voice when the value of the PTS stored in each packet header of the received image and voice becomes equal to the value of the STC of the receiver.
In other words, when the value of the PTS (a regeneration time) is stored in advance at the transmitting end such that the image and the voice are regenerated at the same time at the receiving end, the receiver can obtain an output image and an output voice that are synchronized with each other. The MPEG system is described in detail in “The Latest MPEG Textbook” (in Japanese), Ascii Publishing Co., Ltd., 1995.
According to the above-described conventional videoconference terminal, however, there has been the following problem. When the image and the voice are transmitted, the time required for the compression processing and the expansion processing is not constant and is different depending on the contents of the input signal. Therefore, according to the conventional method of fixing the delay volume, it is not always possible to make the timing of regenerating the image and the timing of regenerating the voice coincide with each other.
Further, according to the image/voice synchronization system that is employed in the MPEG system, it is always possible to make the timing of regenerating the image and the timing of regenerating the voice coincide with each other. However, this system has the following problems.
First, according to the above image/voice synchronization system, a large amount of information is required for the synchronized regeneration of signals. A set range of the PTS value is taken large (24 hours or more) at the receiving end. Therefore. the data width of the PTS is as large as 44 bits. As a result, the circuit scale becomes large. Further, as the PTS is stored in the header, the length of the header becomes large.
Second, the above-described image/voice synchronization system is an MPEG exclusive system. As the PTS is stored in the packet header that is own to the MPEG system, only a system that uses the MPEG system can utilize the PTS. Therefore. there is no compatibility with other motion picture encoding systems.
SUMMARY OF THE INVENTION
The present invention has been made to solve the above-described conventional problems. It is, therefore, an object of the present invention to provide a videoconference terminal and an image and voice regeneration system to be used therefor, which are capable of easily achieving a videoconference with a sense of realism by regenerating the image and the voice at the same timing as that of the transmitting end, without the need for increasing the header information and regardless of compression /expansion system.
In order to meet the above object, according to the present invention, there is provided a videoconference terminal that regenerates an image and voice by always accurately matching the regeneration timing with that at the transmitting end. Therefore, at the receiving end, it is possible to regenerate the image and voice at the same timing as that at the transmitting end.
More specifically. according to the videoconference terminal of the present invention, a videoconference terminal comprising a transmitter and a receiver, wherein the transmitter comprises: an analog-to-digital converter for converting input analog image and analog voice signals to input digital image and voice signals, respectively; a marker for simultaneously and periodically embedding a marking signal in the input digital image signal and the input digital voice signal corresponding to the input digital image signal to produce digital image and voice signals: and a data compressor for compressing the digital image signal and the digital voice signal to produce compressed image signal and compressed voice signal which are transmitted to another end of videoconference.
The receiver comprises: a data expander for expanding received image signal and received voice signal to produce received digital image signal and received digital voice signal; a time difference detector for detecting an arrival time difference between the received digital image signal and the received digital voice signal based on marking signals detected from the received digital image signal and the received digital voice signal, respectively; a digital-to-analog converter for converting the received digital image signal and the received digital voice signal to a received analog image signal and a received analog voice signal; and an adjuster for adjusting timings of the received analog image signal and the received analog voice signal depending on the arrival time difference.
The analog image signal input from a camera or the like is quantized by an image A/D converter. The quantized signal is then passed through a marking signal adding circuit, and is compressed according to a transmission speed in a transmission path by an image compressing circuit. Thereafter, the signal is multiplexed with the voice signal by a multiplexing circuit, and the multiplexed signal is sent to the transmission path.
On the other hand, the analog voice signal input from the microphone or the like is quantized by a voice A/D converter. The quantized signal is then paused through a marking signal adding circuit, and is compressed by a voice compressing circuit. Thereafter, the signal is multiplexed with the image signal by a multiplexing circuit, and the multiplexed signal is sent to the transmission path. The image compressing circuit and the voice compressing circuit compress the quantized image and voice signals respectively by us
LandOfFree
Videoconference system does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Videoconference system, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Videoconference system will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2969964