Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Patent
1997-03-14
1999-11-30
Wieland, Susan
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
704250, 704233, 704256, G10L 900
Patent
active
059959274
ABSTRACT:
A method and an apparatus for performing stochastic matching of a set of input test speech data with a corresponding set of training speech data. In particular, a set of input test speech feature information, having been generated from an input test speech utterance, is transformed so that the stochastic characteristics thereof more closely match the stochastic characteristics of a corresponding set of training speech feature information. The corresponding set of training speech data may, for example, comprise training data which was generated from a speaker having the claimed identity of the speaker of the input test speech utterance. Specifically, in accordance with the present invention, a first covariance matrix representative of stochastic characteristics of input test speech feature information is generated based on the input test speech feature information. Then, a transformation is performed on the input test speech feature information, the transformation being based on the first covariance matrix and on a second covariance matrix representative of the stochastic characteristics of the training speech feature information. This transformation advantageously results in transformed input test speech feature information having stochastic characteristics more closely matched to the stochastic characteristics of the speech training feature information.
REFERENCES:
patent: 5167004 (1992-11-01), Netsch et al.
patent: 5473728 (1995-12-01), Luginbuhl et al.
patent: 5583951 (1996-12-01), Sirat et al.
patent: 5727124 (1998-03-01), Lee et al.
R. J. Mammmone et al., "Robust Speaker Recognition," IEEE Signal Processing Magazine, Sep. 1996, pp. 58-71.
B. S. Atal, "Effectiveness of Linear Prediction Characteristics of the Speech Wave for Automatic Speaker Identificaton and Verification," J. Acoust. Soc. Am., vol. 55, No. 6, Jun. 1974, pp. 1304-1312.
B. S. Atal, "Automatic Recognition of Speakers From Their Voices," Proceedings of the IEEE, vol. 64, No. 4, Apr. 1976, pp. 460-475.
S. Furui, Cepstral Analysis Technique For Automatic Speaker Verification, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP-29, No. 2, apr. 1981, pp. 254-272.
A. E. Rosenberg et al., "Cepstral Channel Normalization Techniques For HMM-Based Speaker Verification," ICSLP 94, Yokohama, pp. 1835-1838.
A. Sankar et al., "A Maximum-Likelihood Approach To Stochastic Matching For Robust Speech Recognition," IEEE Transactions on Speech And Audio Processing, vol. 4, No. 3, May 1996, pp. 190-202.
A. C. Surendran, "Maximum Likelihood Stochastic Matching Approach To Non-Linear Equalization For Robust Speech Recognition," Busch Campus, New Jersey, May 1996, pp. i-xiv, 1-101.
M. G. Rahim, et al., "Signal Bias Removal by Maximum Likelihood Estimation For Robust Telephone Speech Recognition," IEEE Transactions On Speech And audio Processing, vol. 4, No. 1, Jan. 1996, pp. 19-30.
D. Mansour et al., "A Family Of Distortion Measures Based Upon Projection Operation For robust Speech Recognition," IEEE Transactions On Acoustics, Speech, And Signal Processing, vol. 37, No. 11, Nov. 1989, pp. 1659-1671.
S. Parthasarathy et al., "General Phrase Speaker Verification Using Sub-Word Background Models And Likelihood-Ratio Scoring," ICSLP 96, pp. 1-4.
Bogner, Robert E., "Pattern Recognition via Observation Correlations", IEEE Transactions On Pattern Analysis And Machine Intelligence, vol. PAMI-3, No. 2, Mar. 1981, New York, NY, US, pp. 128-133.
Mammone, R. J., Zhang, X., Ramachandran, R. P., "Robust Speaker Recognition: A Feature-based Approach", IEEE Signal Processing Magazine, vol. 13, No. 5, Sep. 1996, New York, NY, US, pp. 58-71.
Brown Kenneth M.
Lucent Technologies - Inc.
Wieland Susan
LandOfFree
Method for performing stochastic matching for use in speaker ver does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method for performing stochastic matching for use in speaker ver, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for performing stochastic matching for use in speaker ver will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1686994