Electrical audio signal processing systems and devices – One-way audio signal program distribution – Public address system
Patent
1987-04-03
1989-02-07
Roskoski, Bernard
Electrical audio signal processing systems and devices
One-way audio signal program distribution
Public address system
3645135, G10L 500
Patent
active
048037290
ABSTRACT:
Smoothed frame labeling associates phonetic frame labels with a given speech frame as a function of (a) the closeness with which the given frame compares to each of a plurality of acoustic models, (b) which frame labels correspond with a neighboring frame, and (c) transition probabilities which indicate, for the frame labels associated with the neighboring frame, which frame labels are probably associated with the given frame. The smoothed frame labeling is used to divide the speech into segments of frames having the same class of labels. The invention represents words as a collection of known diphone models, each of which models the sound before and after a boundary between segments derived by the smoothed frame labeling. At recognition time, the speech is divided into segments by smoothed frame labeling; diphone models are derived for each boundary between the resulting segments; and the resulting diphone models are compared against the known diphone models to determine which of the known diphone models match the segment boundaries in the speech. Then a combined-displaced-evidence method is used to determine which words occur in the speech. This method detects which acoustic patterns, in the form of the known diphone models, match various portions of the speech. In response to each such match, it associates with the speech an evidence score for each vocabulary word in which that pattern is known to occur. It displaces each such score from the location of its associated matched pattern by the known distance between that pattern and the beginning of the score's word. Then all the evidence scores for a word located in a given portion of the speech are combined to produce a score which indicates the probability of that word starting in that portion of the speech. This score is combined with a score produced by comparing a histogram from a portion of the speech against a histogram of each word. The resulting combined score determines whether a given word should undergo a more detailed comparison against the speech to be recognized.
REFERENCES:
patent: 4718092 (1988-01-01), Klovstad
patent: 4718094 (1988-01-01), Bahl et al.
"Stochastic Modeling for Automatic Speech Understanding", Baker, Speech Recognition, pp. 522-542, Academic Press 1975.
"Linear Predictive Hidden Markou Models and the Speech Signal", Poritz, pp. 1291-1294, IEEE Int. Conf. Acoustics, Speech, and Signal Processing.
"The 1976 Modular Acoustic Processor (MAP)" Silverman et al., pp. 15-20.
Conference Record, 1976 IEEE Int. Conf. Acoustics, Speech & Signal Processing "Motivation and Overview of Speechlis" Woods, IEEE Trans. Acoust. Speech & Signal Process, vol. ASSP-23, pp. 2-10, 2/75.
"Orginization of Hearsay II Speech Understanding System", Lesser et al., IEEE Trans. Acoust. Speech, and Signal Proc.,vol. ASSP-23, pp. 11-24 2/75.
"The HW/Mo Speech Understanding System", Wolf et al., IEEE Int. Conf. Acoust. Speech & Signal Process, May 9-11, 1977, pp. 784-787.
Dragon Systems, Inc.
Porter Edward W.
Roskoski Bernard
LandOfFree
Speech recognition method does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Speech recognition method, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech recognition method will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1089900