Patent
1992-12-31
1997-01-21
MacDonald, Allen R.
395 259, 395 262, 395 264, G10L 506, G10L 900
Patent
active
055966802
ABSTRACT:
A method and apparatus for detecting speech activity in an input signal. The present invention includes performing begin point detection using power/zero crossing. Once the begin point has been detected, the present invention uses the cepstrum of the input signal to determine the endpoint of the sound in the signal. After both the beginning and ending of the sound are detected, the present invention uses vector quantization distortion to classify the sound as speech or noise.
REFERENCES:
patent: 4310721 (1982-01-01), Manley et al.
patent: 4348553 (1982-09-01), Baker et al.
patent: 4783804 (1988-11-01), Juang et al.
patent: 4821325 (1989-04-01), Martin et al.
patent: 4860355 (1989-08-01), Copperi
patent: 4903305 (1990-02-01), Gillick et al.
patent: 4945566 (1990-07-01), Mergel et al.
patent: 5027406 (1991-06-01), Roberts et al.
patent: 5056150 (1991-10-01), Yu et al.
patent: 5091948 (1992-02-01), Kametani
patent: 5241619 (1993-08-01), Schwartz et al.
Fast Endpoint detection Algorithm for Isolated and Recognition in office environment.
Dermatas et al. ICASSP-91 p. 733-736 vol. 1 May 1991 Explicit Estimation of Speech boundaries.
Taboada et al. IEE proceedings-Science, Measurement and Technology p. 153-159 --May 1994.
"Speech Recognition, Neural Nets, And Brains" by George M. White, Jan. 1992.
"Large-Vocabulary Speaker-Independent Continuous Speech Recognition: The SPHINX System"by Kai-Fu Lee, Carnegie Mellon University, Pittsburgh, Pennsylvania, Apr. 1988.
"Digital Representations of Speech Signals" by Ronald W. Schafer and Lawrence R. Rabiner, The Institute of Electrical and Electronics Engineers, Inc., 1975, pp. 49-63.
"Speech Recognition by Machine: A Review" by D. Raj Reddy, IEEE Proceedings 64(4):502-531, Apr. 1976, pp. 8-35.
"Vector Quantization" by Robert M. Gray, IEEE, 1984, pp. 75-100.
Markel, J. D. and Gray, Jr., A. H., "Linear Production of Speech," Springer, Berlin Herdelberg New York, 1976.
Rabine, L., Sondhi, M. and Levison, S., "Note on the Properties of a Vector Quantizer for LPC Coefficients,"BSTJ, vol. 62, No. 8, Oct. 1983, pp. 2603-2615.
Linde, Y., Buzo, A., and Gray, R. M., "An Algorithm for a Vector Quantization," IEEE Trans. Commun., COM-28, No. 1 (Jan. 1980) pp. 84-95.
Bahl, I. R., et al., "Large Vocabulary National Language Continuous Speech Recognition," Proceeding of the IEEE CASSP 1989, Glasgow.
Gray, R. M., "Vector Quantization",IEEE ASSP Magazine, Apr. 1984, vol. 1, No. 2, p. 10.
Bahl, L. R., Baker, J. L., Cohen, P. S., Jelineck, F., Lewis, B. L, Mercer, R. L., "Recognition of a Continuously Read Natural Corpus", IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Apr. 1978.
Schwartz, R., Chow, Y., Kimball, O., Roucos, S., Krasner, M., Makhoul, J., "Context-Dependent Modeling for Acoustic-Phonetic Recognition of Continuous Speech," IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Apr. 1985.
Schwartz, R. M., Cow, X. L., Roucos, S., Krauser, M., Makhoul, J., "Improved Hidden Markov Modeling of Phonemes for Continuous Speech Recognition," IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Apr. 1984.
Alleva, F.Hon, H., Huang, X., Hwang, M., Rosenfeld, R., Weide, R., "Applying Sphinx II to DARPA Wall Street Journal CSR Task", Proc. of the DARPA Speech and NL Workshop, Feb. 1992, Morgan Kaufman Pub., San Mateo, CA.
Kai-Fu Lee, "Automatic Speech Recognition," Kluwer Academic Publishers, Boston/Dordrecht/London, 1989.
Chow Yen-Lu
Staats Erik P.
Apple Computer Inc.
Dorvil Richemond
MacDonald Allen R.
LandOfFree
Method and apparatus for detecting speech activity using cepstru does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for detecting speech activity using cepstru, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for detecting speech activity using cepstru will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2331175