Method and apparatus for detecting end points of speech activity

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

395 257, 395 259, 395 264, G10L 506

Patent

active

056921047

ABSTRACT:
A method and apparatus for detecting end points of speech activity in an input signal using spectral representation vectors performs beginning point detection using spectral representation vectors for the spectrum of each sample of the input signal and a spectral representation vector for the steady state portion of the input signal. The beginning point of speech is detected when the spectrum diverges from the steady state portion of the input signal. Once the beginning point has been detected, the spectral representation vectors of the input signal are used to determine the ending point of the sound in the signal. The ending point of speech is detected when the spectrum converges towards the steady state portion of the input signal. After both the beginning and ending of the sound are detected, vector quantization distortion can be used to classify the sound as speech or noise.

REFERENCES:
patent: 4310721 (1982-01-01), Manley et al.
patent: 4783804 (1988-11-01), Juang et al.
patent: 4821325 (1989-04-01), Martin et al.
patent: 4860355 (1989-08-01), Copperi
patent: 4945566 (1990-07-01), Mergel et al.
patent: 5056150 (1991-10-01), Yu et al.
patent: 5091948 (1992-02-01), Kametani
patent: 5241619 (1993-08-01), Schwartz et al.
Markel, J.D. and Gray, Jr., A.H., "Linear Production of Speech," Springer, Berlin Herdelberg New York, 1976.
Rabine, L., Sondhi, M. and Levison, S., "Note on the Properties of a Vector Quantizer for LPC Coefficients," BSTJ, vol. 62, No. 8, Oct. 1983, pp. 2603-2615.
Linde, Y., Buzo, A., and Gray, R.M., "An Algorithm for Vector Quantizer Design," IEEE Trans. Commun., COM-28, No. 1 (Jan. 1980) pp. 84-95.
Bahl, L.R., et al., "Large Vocabulary National Language Continuous Speech Recognition," Proceeding of the IEEE CASSP 1989, Glasgow.
Gray, R.M., "Vector Quantization", IEEE ASSP Magazine, Apr. 1984, vol. 1, No. 2, pp. 4-29.
Bahl, L.R., Baker, J.L., Cohen, P.S., Jelineck, F., Lewis, B.L., Mercer, R.L., "Recognition of a Continuously Read Natural Corpus" IEEE Int. Conf. on Acoustics Speech and Signal Processing, Apr. 1978.
Schwartz, R., Chow, Yl, Kimball, O., Rousos, S., Krasner, M., Makhoul, J., "Context-Dependent Modeling for Acoustic-Phonetic Recognition of Continuous Speech," IEEE Int. Conf. on Acoustics Speech and Signal Processing, Apr. 1985.
Schwartz, R.M., Chow, X.L., Roucos, S., Krauser, M., Makhoul, J., "Improved Hidden Markov Modeling of Phonemes for Continuous Speech Recognition," IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Apr. 1984.
Alleva, F. Hon, H., Huang, X., Hwang, M., Rosenfeld, R., Weide, R., "Applying Sphinx II to DARPA Wall Street Journal CSR Task", Proc. of the DARPA Speech and NL Workshop, Feb. 1992, Morgan Kaufman Pub., San Mateo, CA.
Kai-Fu Lee, "Automatic Speech Recognition," Kluwer Academic Publishers Boston/Dordrecht/London 1989.
Dermatas, et al., "Fast Endpoint Detection Algorithm For Isolated Word Recognition In Office Environment", IEEE, May 1991, pp. 733-736.
J. Taboada, et al., "Explicit Estimation of Speech Boundaries", IEEE, May 1991, pp. 153-159.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method and apparatus for detecting end points of speech activity does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method and apparatus for detecting end points of speech activity, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for detecting end points of speech activity will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2114509

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.