Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Patent
1996-03-28
1999-03-30
Dorvil, Richemond
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
704233, 704500, G10L 302
Patent
active
058901094
ABSTRACT:
One or more dynamically updated measures are generated for an audio stream. Processing is performed using the measures to distinguish silent periods from non-silent periods in the audio stream and the audio stream is encoded, wherein the silent periods are encoded differently from the non-silent periods. The processing is re-initialized during the encoding of the audio stream, if certain conditions are met. In a preferred embodiment, the processing is re-initialized if either of the following two conditions is met: (1) one of the non-silent periods is longer than a duration threshold or (2) an energy measure for the silent periods of the audio stream exceeds an energy threshold level.
REFERENCES:
patent: 4412066 (1983-10-01), Ahmed
patent: 4449190 (1984-05-01), Flannagan et al.
patent: 4704730 (1987-11-01), Turner et al.
patent: 4893197 (1990-01-01), Howells et al.
patent: 5438643 (1995-08-01), Akagiri et al.
"Real-Time Implementation and Evaluation of an Adaptive Silence Deletion Algorithm for Speech Compression," by Chris Rose and Dr. Robert W. Donaldson, IEEE Pacific Rim Conference on Communications, Computers and Signal Processing, May 9-10, 1991, pp. 461-468.
"The Voice Activity Detector for the Pan-European Digital Cellular Mobile Telephone Service," by D.K. Freeman, G. Cosier, C.B. Southcott, and I. Boyd, British Telecom Research Labs. Speech and Language Processing Division, Martlescham Health, Ipswich, England, 1989 IEEE, pp. 369-372.
"Voiced-Unvoiced-Silence Detection Using the Itakura LPC Distance Measure," by L.R. Rabiner and M.R. Sambur, 1977 IEEE International Conference on Acoustics, Speech & Signal Processing at the Sheraton-Hartford Hotel, Hartford, CT, May 9-11, 1977, pp. 323-326.
"Speech and Silence Discrimination Based on ADPCM Signals," by S.N. Koh and N.K. Lim, Journal of Electrical Engineering, Australia--IE Aust & IREE Aust. vol. 11, No. 4, Dec. 1991, pp. 245-248.
"Voiced-Unvoiced-Silence Classification of Speech Signals Based on Statistical Approaches," by B.A.R. Al-Hashemy and S.M.R. Taha, Applied Acoustics 25 1988 Elsevier Science Publishers Ltd. England, pp. 169-179.
"A Fast Neural Net Training Algorithm and Its Application to Voiced-Unvoiced-Silence Classification of Speech," by Thea Ghiselli-Crippa, Amro El-Jaroudi, 1991 IEEE, pp. 441-444.
"Fast Endpoint Detection Algorithm for Isolated Word Recognition in Office Environment," by Evangelos S. Dermatas, Nikos D. Fakotakis, and George K. Kokkinakis, 1991 IEEE, pp. 733-736.
"Silent and Voiced/Unvoiced/Mixed Excitation (Four-Way) Classification of Speech," by D.G. Childers, M. Hahn, and J.N. Larar, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 37, No. 11, Nov. 1989, pp. 1771-1774.
Comments on "An Improved Endpoint Detector for Isolated Word Recognition," by Ben Reaves, IEEE Transacitons on Signal Processing, vol. 39 No. 2, Feb. 1991, pp. 526-527.
"An Improved Endpoint Detector for Isolated Word Recognition," by Lori F. Lamel, Lawrence R. Rabiner, Aaron E. Rosenberg and Jay G. Wilpon, IEEE Transaction on Acoustics, Speech, and Signal Processing, vol ASSP-29, No. 4, Aug. 1981, pp. 777-785.
"Voiced/Unvoiced/Mixed Excitation Classification of Speech," by Leah J. Siegel and Alan C. Bessey, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP-30, No. 3, Jun. 1982, pp. 451-460.
"Voice Activity Detection Using a Periodicity Measure," by R. Tucker, IEE Proceedings-1, vol. 139, No. 4, Aug. 1992, pp. 377-380.
"Application of an LPC Distance Measure to the Voiced-Unvoiced-Silence Detection Problem," by Lawrence R. Rabiner and Marvin R. Sambur, IEEE Transaction on Acoustics, Speech, and Signal Processing, vol. ASSP-25, No. 4, Aug. 1977, pp. 338-343.
"A Pattern Recognition Approach to Voiced-Unvoiced-Silence Classification with Applications to Speech Recognition," by Bishnu S. Atal and Lawrence R. Rabiner, IEEE Transacitons on Acoustics, speech, and Signal Processing, vol. ASSP-24, No. 3, Jun. 1976, pp. 201-212.
"Multimedia Conferencing in the Etherphone Enviornment," by Harrick M. Vin, Polle T. Zellweger, Daniel C. Swinehart, P. Venkat Rangan, Xerox Palo Alto Research Center, Oct. 1991 IEEE, pp. 69-79.
"Linear Prediction: A Tuturial Review," by John Makhoul, Proceedings of the IEEE, vol. 63, No. 4, Apr. 1975, pp. 561-580.
Gan, C. and Donaldson, "Adaptive Silence Deletion for Speech Storage and Voice Mail Applications", IEEE Tranactions on Acoustics, Speech, and Signal Processing Jun. 1988, 36(6), 924-927.
Southcott, C.B. et al, "Voice Control of the Pan-European Digital Mobile Radio System", Communications Technology for the 1990's and Beyond, Institue of Electrical and Electronics Engineers, Nov. 27-30, 1989, vol. 2 of 3, 1070-1074.
Keith Michael
Kidder Jeffrey
Walker Mark R.
Dorvil Richemond
Intel Corporation
Kinsella N. Stephan
Murray William H.
LandOfFree
Re-initializing adaptive parameters for encoding audio signals does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Re-initializing adaptive parameters for encoding audio signals, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Re-initializing adaptive parameters for encoding audio signals will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1225227