Patent
1996-01-16
1997-07-15
MacDonald, Allen R.
G10L 900
Patent
active
056490577
ABSTRACT:
Speaker independent recognition of small vocabularies, spoken over the long distance telephone network, is achieved using two types of models, one type for defined vocabulary words (e.g., collect, calling-card, person, third-number and operator), and one type for extraneous input which ranges from non-speech sounds to groups of non-vocabulary words (e.g. `I want to make a collect call please`). For this type of key word spotting, modifications are made to a connected word speech recognition algorithm based on state-transitional (hidden Markov) models which allow it to recognize words from a pre-defined vocabulary list spoken in an unconstrained fashion. Statistical models of both the actual vocabulary words and the extraneous speech and background noises are created. A syntax-driven connected word recognition system is then used to find the best sequence of extraneous input and vocabulary word models for matching the actual input speech.
REFERENCES:
patent: Re32012 (1985-10-01), Pirz et al.
patent: 4481593 (1984-11-01), Bahler
patent: 4713777 (1987-12-01), Klovstad et al.
patent: 4783804 (1988-11-01), Juang et al.
patent: 4802231 (1989-01-01), Davis
patent: 4811399 (1989-03-01), Landell et al.
patent: 4827521 (1989-05-01), Bahl et al.
patent: 4829577 (1989-05-01), Kuroda et al.
patent: 4837831 (1989-06-01), Gillick et al.
patent: 4914703 (1990-04-01), Gillick
patent: 4977599 (1990-12-01), Bahl et al.
patent: 5199077 (1993-03-01), Wilcox et al.
patent: 5218668 (1993-06-01), Higgins et al.
patent: 5440662 (1995-08-01), Sukkar
patent: 5452397 (1995-09-01), Ittycheriah et al.
Rohlicek et al, "Continuous Hidden Markov Modeling For Speaker-Independent Word Spotting", IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, 627-630 (May 1988).
Wilpon et al., "Isolated Word Recognition Over the DDD Telephone Network--Results of Two Extensive Field Studies," IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, 55-58 (Apr. 1988).
Markowitz, "Keyword Spotting in Speech", Al Expert, pp. 21-25 Oct. 1994.
Wilpon et al., "Automatic Recognition of Keywords in Unconstrained Speech Using Hidden Markov Models", IEEE Trans. on Acoustics Speech and Signal Proc., vol. 38, No. 11, pp. 1870-1878 Nov. 1990.
"Detecting and Locating Key Words in Continuous Speech Using Linear Predictive Coding," by Christiansen and Rushforth, IEEE Transactions on Acoustics, Speech and Signal Processing, vol. ASSP 25 No. 5, pp. 362-367, Oct. 1977.
"Keyword Recognition Using Template Concatenation," by Higgins and Wohlford, IEEE Int. Conf. Acous. Speech, and Signal Processing, pp. 1233-1236, Mar. 1985.
"Application of Hidden Markov Models to Automatic Speech Endpoint Detection," by Wilpon and Rabiner, Computer Speech and Language, vol. 2, 3/4 pp. 321-341, Dec. 1987.
Digital Processing of Speech Signals, L. R. Rabiner et al., Prentice Hall, pp. 356-372 and 398-401 (1978).
"The Frequency Analysis of Time Series for Echoes," Proc. Symp. on Time Series Analysis, Bo Bogert et al., Ch. 15, pp. 209-243, 1963.
Digital Processing of Speech Signals, by L. R. Rabiner et al., Prentice Hall, pp. 121 (1978).
"The Use of Bandpass Filtering in Speech Recognition," IEEE Transactions on Acoustics, Speech and Signal Processing, by B. Juang et al., ASSP 35, no. 7, pp. 947-954, Jul. 1987.
"On the Use of Instantaneous and Transitional Spectral Information in Speaker Recognition," by F. K. Soong et al., IEEE Transactions on Acoustics, Speech and Signal Processing, vol. ASSP 36, No. 6, pp. 871-879, Jun. 1988.
"High Performance Connected Digit Recognition Using Hidden Markov Models," by L. R. Rabiner et al., IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 1, pp. 119-122, Apr. 1988.
"A Network-Based Frame Synchronous Level Building Algorithm for Connected Word Recognition," by C-H. Lee et al., IEEE Int. Conf. Acous. Speech and Sig. Processing, vol. 1, pp. 410-413, Apr. 1988.
"A Segmental K-means Training Procedure for Connected with Recognition Based on Whole Word Reference Patterns," by L. R. Rabiner et al., AT&T Technical Journal, vol. 65, No. 3, pp. 21-31, May 1986.
Lee Chin-Hui
Rabiner Lawrence Richard
Wilpon Jay Gordon
Chawan Vijay B.
Lucent Technologies - Inc.
MacDonald Allen R.
Slusky Ronald D.
LandOfFree
Speech recognition employing key word modeling and non-key word does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Speech recognition employing key word modeling and non-key word , we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech recognition employing key word modeling and non-key word will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1497829