Data processing: speech signal processing – linguistics – language – Modification of at least one characteristic of speech waves – Transformation of speech into a nonaudible representation,...
Patent
1997-08-11
1999-10-19
Teska, Kevin J.
Data processing: speech signal processing, linguistics, language
Modification of at least one characteristic of speech waves
Transformation of speech into a nonaudible representation,...
704231, 704236, G06F 9455
Patent
active
059702397
ABSTRACT:
Method for performing acoustic model estimation to optimize classification accuracy on speaker derived feature vectors with respect to a plurality of classes corresponding to phones to which a plurality of acoustic models respectively correspond comprises: (a) initializing an acoustic model for each phone; (b) evaluating the merit of the acoustic model initialized for each phone utilizing an objective function having a two component discriminant measure capable of characterizing each phone whereby a first component is defined as a probability that the model for the phone assigns to feature vectors from the phone and a second component is defined as a probability that the model for the phone assigns to feature vectors from other phones; (c) adapting the model for selected phones so as to increase the first component for the phone or decrease the second component for the phone, the adapting step yielding a new model for each selected phone; (d) evaluating the merit of the new models for each phone adapted in step (c) utilizing the two component measure; (e) comparing results of the evaluation of step (b) with results of the evaluation of step (d) for each phone, and if the first component has increased or the second component has decreased, the new model is kept for that phone, else the model originally initialized is kept; (f) estimating parameters associated with each model kept for each phone in order to optimize the function; and (g) evaluating termination criterion to determine if the parameters of the models are optimized.
REFERENCES:
patent: 5195167 (1993-03-01), Bahl et al.
patent: 5222146 (1993-06-01), Bahl et al.
patent: 5455889 (1995-10-01), Bahl et al.
patent: 5497447 (1996-03-01), Bahl et al.
patent: 5615299 (1997-03-01), Bahl et al.
patent: 5787394 (1998-07-01), Bahl et al.
L.R. Bahl, P.F. Brown, P.V. deSouza, R.L. Mercer in "Maximum Mutual Information Estimation of Hidden Markov Model Parameters for Speech Recognition", Proceedings of the ICASSP, pp. 49-52, 1986.
B.H. Juang, W. Chou, C.H. Lee in "Minimum Classification Error Rate Methods for Speech Recognition", IEEE Trans. on Speech and Audio Processing, vol. 5, pp. 257-265, May 1997.
A.P. Dempster, N.M. Laird, D.B. Rubin in "Maximum Likelihood Estimation from Incomplete Data via the EM Algorithm", Journal of the Royal Statistical Society (B), vol. 39, No. 1, pp. 1-38, 1979.
R.O. Duda and P.E. Hart in "Pattern Classification and Scene Analysis", Wiley, New York, 1973.
R. Lippman in "Pattern Classification Using Neural Networks", IEEE Communications Magazine, pp. 11:47-64, 1989.
Y. Normandin in "Optimal Splitting of HMM Gaussian Mixture Components with MMIE Training", Proceedings of the ICASSP, pp. 449-452, 1995.
A.J. Viterbi in "Error Bounds for Convolutional Codes and An Asymptotically Optimum Decoding Algorithm", IEEE Trans. on Information Theory, vol. IT-13, pp. 260-269, Apr. 1967.
Bahl Lalit Rai
Padmanabhan Mukund
Broda Samuel
International Business Machines - Corporation
Teska Kevin J.
LandOfFree
Apparatus and method for performing model estimation utilizing a does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Apparatus and method for performing model estimation utilizing a, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Apparatus and method for performing model estimation utilizing a will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2067233