1994-12-30
1997-06-10
MacDonald, Allen R.
395 257, 395 211, 395 241, G10L 900
Patent
active
056384874
ABSTRACT:
A scheme for recognizing speech represented by a sequence of frames of acoustic events separated by boundaries, according to which the frames of speech are processed to assign to received frames respective boundary probabilities representative of the degree to which the frames of speech correspond to stored representations of boundaries between acoustic events. The assigned boundary probabilities are used in subsequent processing steps to enhance recognition of speech. The assignment of boundary probabilities and further adjustments of the assigned probabilities are preferably conducted by an artificial neural network (ANN).
REFERENCES:
patent: 3943295 (1976-03-01), Martin et al.
patent: 4349700 (1982-09-01), Pirz et al.
patent: 4665548 (1987-05-01), Kahn
patent: 4783803 (1988-11-01), Baker et al.
patent: 4803729 (1989-02-01), Baker
patent: 4821325 (1989-04-01), Martin et al.
patent: 4945566 (1990-07-01), Mergel et al.
patent: 5285522 (1994-02-01), Mueller
patent: 5305422 (1994-04-01), Junqua
patent: 5479563 (1995-12-01), Yamaguchi
Cole et al.("Speaker-Independent Recognition of spoken english letters", Neural Networks, 1990 International Conference, Jan. 1990, vol. 11, pp. 45-51).
Farrell et al.,(Speaker recognition using neural networks and conventional classifiers", IEEE transactions on Speech and Audio Processing, Jan. 1994, vol. 11, pp. 194-205).
Leung et al., "A Comparative Study of Signal Representations and Classification Techniques for Speech Recognition", 1993 IEEE, pp. 680-683.
Benjamin Chigier, "Phonetic Classification on Wide-Band and Telephone Quality Speech", 1992.
Chigier et al., "The Effects of Signal Representations, Phonetic Classification Techniques, and the Telephone Network", ICSLP 1992.
Pitrelli et al., "Multiple-Level Evaluation of Speech Recognition Systems", ICSLP 1992.
Chigier et al., "Are Laboratory Databases Appropriate for Training and Testing Telephone Speech Recognizers?", ICSLP, 1990, pp, 1017-1020.
Chigier et al., "Broad Class Network Generation Using a Combination of Rules and Statistics for Speaker Independent Continuous Speech", 1988 IEEE, pp. 449-452.
Cole et al., "The C-MU Phonetic Classification System", 1986 IEEE, pp. 2255-2257.
Thomas et al., "The Sensitivity of Speech Recognisers to Speaker Variability and Speaker Variation (Before Dec. 1993).
Chigier et al., "Analysis of Two Algorithms for Telephone Speech Recognition" (Before Dec. 1993).
Digalakis et al., "Fast Algorithms for Phone Classification and Recognition Using Segment-Based Models"(Before Dec. 1993).
Chigier, "Phonetic Classification on Wide-Band and Telephone Quality Speech," Proceedings 5th Workshop, presented at Arden House, NY, 1992.
Mandelbaum, "SPEECH: Just say the word," IEEE Spectrum, 30, Feb. 1994.
Hammerstrom, "Working with neural networks," IEEE Spectrum, 46-53, Jul. 1993.
Hammerstrom, "Neural networks at work," IEEE Spectrum, 26-32, Jun. 1993.
Lippmann, "An Introduction to Computing with Neural Nets," IEEE ASSP 4:4-22, 1987.
Hush et al., "Progress in Supervised Neural Networks," IEEE Signal Processing 8-39, Jan. 1993.
Waibel et al., "Phoneme Recognition Using Time-Delay Neural Networks," IEEE Transactions on Acoustics, Speech, and Signal Processing 37:328-339, 1989.
Tank et al., "Neural computation by concentrating information in time," Proc. Natl Acad. Sci. USA 84:1896-1900, 1987.
Levin, "Word Recognition Using Hidden Control Neural Architecture," ICASSP 90 Proceedings, 1:433-436, 1990.
Chawan Vijay B.
MacDonald Allen R.
PureSpeech, Inc.
LandOfFree
Automatic speech recognition does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Automatic speech recognition, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Automatic speech recognition will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-771762