Patent
1994-11-30
1998-03-17
MacDonald, Allen R.
G10L 506
Patent
active
057296561
ABSTRACT:
A method for estimating the probability of phone boundaries and the accuracy of the acoustic modelling in reducing a search-space in a speech recognition system. The accuracy of the acoustic modelling is quantified by the rank of the correct phone. The system includes a microphone for converting an utterance into an electrical signal, which is processed by an acoustic processor and label match which finds the best-matched acoustic label prototype. A probability distribution on phone boundaries is produced for every time frame using a first decision tree. These probabilities are compared to a threshold and some time frames are identified as boundaries between phones. An acoustic score is computed for all phones between every given pair of hypothesized boundaries, and the phones are ranked on the basis of this score. A second decision tree is traversed for every time frame to obtain the worst case rank of the correct phone at that time, and a short list of allowed phones is made for every time frame. A fast acoustic word match processor matches the label string from the acoustic processor to produce an utterance signal which includes at least one word. From recognition candidates produced by the fast acoustic match and the language model, the detailed acoustic match matches the label string from the acoustic processor against acoustic word models and outputs a word string corresponding to an utterance.
REFERENCES:
patent: 4718094 (1988-01-01), Bahl et al.
patent: 4741036 (1988-04-01), Bahl et al.
patent: 4773093 (1988-09-01), Higgins et al.
patent: 4803729 (1989-02-01), Baker
patent: 4805219 (1989-02-01), Baker et al.
patent: 4813074 (1989-03-01), Marcus
patent: 4852173 (1989-07-01), Bahl et al.
patent: 4977599 (1990-12-01), Bahl et al.
patent: 5027408 (1991-06-01), Kroeker et al.
patent: 5144671 (1992-09-01), Mazor et al.
patent: 5222146 (1993-06-01), Bahl et al.
patent: 5233681 (1993-08-01), Bahl et al.
patent: 5263117 (1993-11-01), Nadas et al.
patent: 5280562 (1994-01-01), Bahl et al.
patent: 5293584 (1994-03-01), Brown et al.
patent: 5390278 (1995-02-01), Gupta et al.
L.R. Bahl et al., "Faster Acoustic Match Computation", IBM Technical Disclosure Bulletin, vol. 23, No. 4, Sep. 1980, pp. 1718-1719.
P.S. Gopalakrishnan et al., "Channel-Bank-Based Thresholding to Improve Search Time in the Fast-Match", IBM Technical Disclosure Bulletin, vol. 37, No. 02A, Feb. 1994, pp. 113-114.
V. Algazi et al., "Transform Representation of the Spectra of Acoustic Speech Segments With Applications I: General Approach and Application to Speech Recognition", IEEE Transactions on Speech and Audio Processing, vol. 1, No. 2, Apr., 1993 pp. 180-195.
L. Bahl, "A FAst Approximate Acoustic Match for Large Vocabulary Speech Recognition", IEEE Transactions on Speech and Audio Processing, vol. 1, No. 1, Jan. 1993, pp. 59-67.
Nahamoo David
Padmanabhan Mukund
International Business Machines - Corporation
MacDonald Allen R.
Mattson Robert
LandOfFree
Reduction of search space in speech recognition using phone boun does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Reduction of search space in speech recognition using phone boun, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Reduction of search space in speech recognition using phone boun will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-965526