Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Reexamination Certificate
2003-12-03
2008-11-25
Vo, Huyen X. (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
C704S243000, C704S244000
Reexamination Certificate
active
07457745
ABSTRACT:
A fast on-line automatic speaker/environment adaptation suitable for speech/speaker recognition system, method and computer program product are presented. The system comprises a computer system including a processor, a memory coupled with the processor, an input coupled with the processor for receiving acoustic signals, and an output coupled with the processor for outputting recognized words or sounds. The system includes a model-adaptation system and a recognition system, configured to accurately and efficiently recognize on-line distorted sounds or words spoken with different accents, in the presence of randomly changing environmental conditions. The model-adaptation system quickly adapts standard acoustic training models, available on audio recognition systems, by incorporating distortion parameters representative of the changing environmental conditions or the speaker's accent. By adapting models already available to the new environment, the system does not need separate adaptation training data.
REFERENCES:
patent: 6766295 (2004-07-01), Murveit et al.
A. Sankar, et al., “A maximum likelihood approach to stochastic matching for robust speech recognition,” IEEE TSAP, vol. 4, pp. 190-202, May 1996.
R. C. Rose et al., “Integ. models of signal and backgd with appl. to speaker id. in noise,” IEEE TSAP, vol. 2, No. 2, pp. 245-257, Apr. 1994.
Hui Jiang, et al., “A robust compensation strategy for extraneous acoustic variations in spontaneous speech recognition,” IEEE TSAP, vol. 10, No. 1, pp. 9-17, Jan. 2002.
J. McDonough, T. Schaaf, and A. Waibel, “On maximum mutual information speaker-adapted training”, ICAASP 2002, vol. 1, pp. 601-604, 2002.
B. Zhou, et al., “Rapid speaker adaptation using multi-stream structural max likelihood eigenspace mapping,” ICASSP 2002, vol. 4, pp. 4166-4169, 2002.
J-T Chien, “Online unsupervised learning of hidden Markov models for adaptive speech rec.,” IEE Proceedings on Vision, I. & S. Proc., vol. 148, No. 5, pp. 315-324, Oct. 2001.
S. Wang, et al., “Online Bayesian tree-structured transformation of HMMs with optimal model selection for speaker adaptation,” IEEE TSAP, vol. 9, No. 6, Sep. 2001.
Burns Ron
Iseli Markus
Kadambe Shubha
HRL Laboratories LLC
Tope-McKay & Associates
Vo Huyen X.
LandOfFree
Method and apparatus for fast on-line automatic... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for fast on-line automatic..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for fast on-line automatic... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4039844