Apparatus and method for performing model estimation utilizing a

Data processing: speech signal processing – linguistics – language – Modification of at least one characteristic of speech waves – Transformation of speech into a nonaudible representation,...

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

704231, 704236, G06F 9455

Patent

active

059702397

ABSTRACT:
Method for performing acoustic model estimation to optimize classification accuracy on speaker derived feature vectors with respect to a plurality of classes corresponding to phones to which a plurality of acoustic models respectively correspond comprises: (a) initializing an acoustic model for each phone; (b) evaluating the merit of the acoustic model initialized for each phone utilizing an objective function having a two component discriminant measure capable of characterizing each phone whereby a first component is defined as a probability that the model for the phone assigns to feature vectors from the phone and a second component is defined as a probability that the model for the phone assigns to feature vectors from other phones; (c) adapting the model for selected phones so as to increase the first component for the phone or decrease the second component for the phone, the adapting step yielding a new model for each selected phone; (d) evaluating the merit of the new models for each phone adapted in step (c) utilizing the two component measure; (e) comparing results of the evaluation of step (b) with results of the evaluation of step (d) for each phone, and if the first component has increased or the second component has decreased, the new model is kept for that phone, else the model originally initialized is kept; (f) estimating parameters associated with each model kept for each phone in order to optimize the function; and (g) evaluating termination criterion to determine if the parameters of the models are optimized.

REFERENCES:
patent: 5195167 (1993-03-01), Bahl et al.
patent: 5222146 (1993-06-01), Bahl et al.
patent: 5455889 (1995-10-01), Bahl et al.
patent: 5497447 (1996-03-01), Bahl et al.
patent: 5615299 (1997-03-01), Bahl et al.
patent: 5787394 (1998-07-01), Bahl et al.
L.R. Bahl, P.F. Brown, P.V. deSouza, R.L. Mercer in "Maximum Mutual Information Estimation of Hidden Markov Model Parameters for Speech Recognition", Proceedings of the ICASSP, pp. 49-52, 1986.
B.H. Juang, W. Chou, C.H. Lee in "Minimum Classification Error Rate Methods for Speech Recognition", IEEE Trans. on Speech and Audio Processing, vol. 5, pp. 257-265, May 1997.
A.P. Dempster, N.M. Laird, D.B. Rubin in "Maximum Likelihood Estimation from Incomplete Data via the EM Algorithm", Journal of the Royal Statistical Society (B), vol. 39, No. 1, pp. 1-38, 1979.
R.O. Duda and P.E. Hart in "Pattern Classification and Scene Analysis", Wiley, New York, 1973.
R. Lippman in "Pattern Classification Using Neural Networks", IEEE Communications Magazine, pp. 11:47-64, 1989.
Y. Normandin in "Optimal Splitting of HMM Gaussian Mixture Components with MMIE Training", Proceedings of the ICASSP, pp. 449-452, 1995.
A.J. Viterbi in "Error Bounds for Convolutional Codes and An Asymptotically Optimum Decoding Algorithm", IEEE Trans. on Information Theory, vol. IT-13, pp. 260-269, Apr. 1967.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Apparatus and method for performing model estimation utilizing a does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Apparatus and method for performing model estimation utilizing a, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Apparatus and method for performing model estimation utilizing a will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2067233

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.