Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Reexamination Certificate
1999-09-28
2002-05-21
Dorvil, Richemond (Department: 2654)
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
C704S219000, C704S222000
Reexamination Certificate
active
06393392
ABSTRACT:
TECHNICAL FIELD
The present invention relates to encoding and decoding of multi-channel signals, such as stereo audio signals.
BACKGROUND OF THE INVENTION
Existing speech coding methods are generally based on single-channel speech signals. An example is the speech coding used in a connection between a regular telephone and a cellular telephone. Speech coding is used on the radio link to reduce bandwidth usage on the frequency limited air-interface. Well known examples of speech coding are PCM (Pulse Code Modulation), ADPCM (Adaptive Differential Pulse Code Modulation), sub-band coding, transform coding, LPC (Linear Predictive Coding) vocoding, and hybrid coding, such as CELP (Code-Excited Linear Predictive) coding. See A. Gersho, “Advances in Speech and Audio Compression”, Proc. of the IEEE, Vol. 82, No. 6, pp. 900-918, June 1994; A. S. Spanias, “Speech Coding: A Tutorial Review”, Proc. of the IEEE, Vol. 82, No. 10, pp. 1541-1582, October 1994.
In an environment where the audio/voice communication uses more than one input signal, for example a computer workstation with stereo loudspeakers and two microphones (stereo microphones), two audio/voice channels are required to transmit the stereo signals. Another example of a multi-channel environment would be a conference room with two, three or four channel input/output. These types of applications are expected to be used on the internet and in third generation cellular systems.
From the area of music coding it is known that correlated multi-channels are more efficiently coded if a joint coding technique is used, an overview is given in P. Noll, “Wideband Speech and Audio Coding”, IEEE Commun. Mag. Vol. 31, No. 11, pp. 34-44, 1993. In B. Grill et al., “Improved MPEG-2 Audio Multi-Channel Encoding”, 96
th
Audio Engineering Society Convention, pp. 1-9, 1994, W. R. Th. Ten Kate et al., “Matrixing of Bit Rate Reduced Audio Signals”, Proc. ICASSP, Vol. 2, pp. 205-208, 1992, and M. Bosi et al., “ISO/IEC MPEG-2 Advanced Audio Coding”, 101
st
Audio Engineering Society Convention, 1996 a technique called matrixing (or sum and difference coding) is used. Prediction is also used to reduce inter-channel redundancy, see B. Grill et al., “Improved MPEG-2 Audio Multi-Channel Encoding”, 96
th
Audio Engineering Society Convention, pp. 1-9, 1994, W. R. Th. Ten Kate et al., “Matrixing of Bit Rate Reduced Audio Signals”, Proc. ICASSP, Vol. 2, pp. 205-208, 1992, M. Bosi et al., “ISO/IEC MPEG-2 Advanced audio Coding”, 101
st
Audio Engineering Society Convention, 1996, and EP 0 797 324 A2, Lucent Technologies, Inc., “Enhanced stereo coding method using temporal envelope shaping”, where the prediction is used for intensity coding or spectral prediction. Another technique known from WO 90/16136, British Teleom., “Polyphonic Coding” uses time aligned sum and difference signals and prediction between channels. Furthermore, prediction has been used to remove redundancy between channels in waveform coding methods. See WO 97/04621, Robert Bosch Gmbh, “Process for reducing redundancy during the coding of multi-channel signals and device for decoding redundancy reduced multi-channel signals”. The problem of stereo channels is also encountered in the echo cancellation area, an overview is given in M Mohan Sondhi et al., “Stereophonic Acoustic Echo Cancellation—An Overview of the Fundamental Problem”, IEEE Signal Processing Letters, Vol. 2, No. 8, August 1995.
From the described state of the art it is known that a joint coding technique will exploit the inter-channel redundancy. This feature has been used for audio (music) coding at higher bit rates and in connection with waveform coding, such as sub-band coding in MPEG. To reduce the bit rate further, below M (the number of channels) times 16-20 kb/s, and to do this for wideband (approximately 7 kHz) or narrowband (3-4 kHz) signals requires a more efficient coding technique.
SUMMARY OF THE INVENTION
An object of the present invention is to reduce the coding bit rate in multi-channel analysis-by-synthesis signal coding from M (the number of channels) times the coding bit rate of a single (mono) channel bit rate to a lower bit rate.
This object is solved in accordance with the appended claims.
Briefly, the present invention involves generalizing different elements in a single-channel linear predictive analysis-by-synthesis (LPAS) encoder with their multi-channel counterparts. The most fundamental modifications are the analysis and synthesis filters, which are replaced by filter blocks having matrix-valued transfer functions. These matrix-valued transfer functions will have non-diagonal matrix elements that reduce inter-channel redundancy. Another fundamental feature is that the search for best coding parameters is performed closed-loop (analysis-by-synthesis).
REFERENCES:
patent: 4636799 (1987-01-01), Kubick
patent: 4706094 (1987-11-01), Kubick
patent: 5105372 (1992-04-01), Provost et al.
patent: 5235647 (1993-08-01), Van de Kerkhof
patent: 5924062 (1999-07-01), Maung
patent: 6104321 (2000-08-01), Akagiri
patent: 6307962 (2001-10-01), Parker et al.
patent: 0 797 324 (1997-09-01), None
patent: WO 90/16136 (1990-12-01), None
patent: WO 93/10571 (1993-05-01), None
patent: WO 97/04621 (1997-02-01), None
Gersho, A., “Advances in Speech and Audio Compression,” Proc. of the IEEE, vol. 82, No. 6, pp. 900-916, Jun. 1994.
Spanias, A.S., “Speech Coding: A Tutorial Review,” Proc. of the IEEE, vol. 82, Vo. 10, pp. 1541-1582, Oct. 1994.
Noll, P., “Wideband Speech and Audio Coding,” IEEE Commun. Mag. vol. 31, No. 11, pp. 34-44, 1993.
Grill, B., et al., “Improved MPEG-2 Audio Multi-Channel Encoding,” 96thAudio Engineering Society Convention, 1996.
Th. Ten Kate, W.R., et al., “Matrixing of Bit Rate Reduced Audio Signals,” Proc. ICASSP, vol. 2, pp. 205-208, 1992.
Bosi, M., et al., “ISO/IEC MPEG-2 Advanced Audio Coding,” 101stAudio Engineering Society Convention, 1996.
Sondhi, M. Mohan, et al., “Sterophonic Acoustic Echo Cancellation—An Overview of the Fundamental Problem,” IEEE Signal Processing Letters, vol. 2, No. 8, Aug. 1995.
Kroon, P., et al., “A Class of Analysis-by-Synthesis Predictive Coders for High Quality Speech Coding at Rates Between 4.8 and 16 kbits/s,” IEEE Journ. Sel. Areas Com., vol. SAC-6, No. 2, pp. 353-363, Feb. 1988.
Laflamme, C., et al., “16 Kbps Wideband Speech Coding Technique Based on Algebraic CELP,” Proc. ICASSP, pp. 13-16, 1991.
Krembel, L., EPO Standard Search Report, File No. RS 101759, Re: SEA 9803321, pp. 1-3, Mar. 30, 1999.
Stoll, G., et al., “MPEG-2 Audio: TheNew MPEG-1 Compatible Standard for Encoding of Digital Surround Sound for DAB, DVB and Computer Multimedia,” ITG-Fachberichte, No. 133, pp. 153-160, Jan. 1, 1995, XP 000571182.
Benyassine, A., et al., “Multiband CELP Coding of Speech,” Proceedings of the Asilomar Conference on Signals, Systems and Computers, Pacific Grove, Nov. 5-7, 1990, vol. 2, No. Conf. 24, pp. 644-648, Nov. 5, 1990. XP000280093.
Fuchs, H., “Improving Joint Stero Audio Coding by Adaptive Inter-Channel Prediction,” IEEE Workshop on Applications of Signal Processing to Audio Acoustics, pp. 39-42, Oct. 17, 1993, XP000570718.
Ikeda, K. et al., “Audio Transfer System on PHS Using Error-Protected Stereo Twin VQ,” 1998 International Conference on Consumer Electronics, Los Angeles, CA, USA, Jun. 2-4, 1998, vol. 44, No. 3, pp. 1032-1038, XP002097383, ISSN 0098-3063, IEEE Transactions on Consumer Electronics, IEEE, USA, Aug. 1998.
Bengtsson, R., International Search Report, International App. No. PCT/SE99/02067, Mar. 24, 2000, pp. 1-3.
Dorvil Richemond
Jenkens & Gilchrist P.C.
Nolan Daniel
Telefonaktiebolaget LM Ericsson (publ)
LandOfFree
Multi-channel signal encoding and decoding does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Multi-channel signal encoding and decoding, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Multi-channel signal encoding and decoding will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2891586