Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Patent
1995-11-07
1998-08-25
MacDonald, Allen R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
704208, 704207, 704231, 704257, G10L 506
Patent
active
057992762
ABSTRACT:
Knowledge based speech recognition apparatus and methods are provided for translating an input speech signal to text. The speech recognition apparatus captures an input speech signal, segments it based on the detection of pitch period, and generates a series of hypothesized acoustic feature vectors for the input speech signal that characterizes the signal in terms of primary acoustic events, detectable vowel sounds and other acoustic features. The apparatus and methods employ a largely speaker-independent dictionary based upon the application of phonological and phonetic/acoustic rules to generate acoustic event transcriptions against which the series of hypothesized acoustic feature vectors are compared to select word choices. Local and global syntactic analysis of the word choices is provided to enhance the recognition capability of the methods and apparatus.
REFERENCES:
patent: 3700815 (1972-10-01), Doddington et al.
patent: 4718094 (1988-01-01), Bahl et al.
patent: 4748670 (1988-05-01), Bahl et al.
patent: 4837831 (1989-06-01), Gillick et al.
patent: 4903305 (1990-02-01), Gillick et al.
patent: 5027406 (1991-06-01), Roberts et al.
patent: 5033087 (1991-07-01), Bahl et al.
patent: 5040127 (1991-08-01), Gerson
patent: 5170432 (1992-12-01), Hackbarth et al.
patent: 5202952 (1993-04-01), Gillick et al.
patent: 5384892 (1995-01-01), Strong
patent: 5390279 (1995-02-01), Strong
patent: 5526463 (1996-06-01), Gillick et al.
Cohen, Michael Harris "Phonological Structures for Speech Recognition", PhD Dissertation for Department of Electrical Engineering and Computer Science, University of California at Berkeley, Apr. 1989.
Cole, Ronald and Fantz, Mark, "Spoken Letter Recognition", pp. 385-390.
Hanson, Helen M. et al., "A System for Finding Speech Formants and modulations via Energy Separation", Jul. 3, 1993.
Mori, de R., Nato Asi Series, vol. F45, "Knowledge-Based Computer Recognition of Speech" pp. 271-290, 1988.
Neuburg, Edward, Darpa, "Speech Tutorial" pp. 53-71, Feb. 1989.
Oppenheim, Alan U. and Schafer, Ronald W., Discrete-Time Signal Processing, pp. 444-450 and 723-727, 1989.
Quinnell, Richard A., EDN, "Speech Reconition, No Longer A Dream But Still A Challenge", pp. 41-46, Jan. 19, 1995.
Rabiner, L.R. and Young, B.H., Nato Asi Series, vol. F75, "Hidden Markov Models for Speech Recognition-Strengths and Limitations", 1992.
Zue, Victor W., Knowledge-Based Approaches, "The Use of Speech Knowledge in Automatic Speech Recognition", pp. 200-213, 1985.
Zue, Victor, et al., The MIT Summit Speech Recognition System; A Progress Report, Proceedings, Speech & Natural Language Workshop, pp. 179-189, Oct. 1989, DARPA, published by Morgan-Kaufmann Publishers, Inc., San Mateo, CA.
Zue, Victor, et al., Recent Progress on the Summit System, Proceedings, Speech & Natural Language Workshop, pp. 380-384, Jun. 1990, DARPA, published by Morgan-Kaufmann Publishers, Inc., San Mateo, CA.
Zue, Victor, The Summit Speech Recognition System: Phonological Modelling and Lexical Access, ICASSP-90, vol. II, 1990 IEEE International Conference on Acoustics, Speech & Signal Processing, pp. 49-52, Apr. 27-30, 1993, Minneapolis, Conference Proceedings.
Rabiner et al. "Fundamentals of Speech Recognition." Prentice Hall, pp. 469-479, 1993.
Arlazarov Vladimir
Bogdanov Dimitri
Finkelstein Yuri
Ivanov Andrey
Kaminsky Jacob
Accent Incorporated
Collins Alphonso A.
MacDonald Allen R.
Pisano Nicola A.
LandOfFree
Knowledge-based speech recognition system and methods having fra does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Knowledge-based speech recognition system and methods having fra, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Knowledge-based speech recognition system and methods having fra will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-46610