Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Reexamination Certificate
2007-08-07
2007-08-07
Opsasnick, Michael N (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
C704S228000
Reexamination Certificate
active
11059036
ABSTRACT:
A method and apparatus are provided for reducing noise in a training signal and/or test signal. The noise reduction technique uses a stereo signal formed of two channel signals, each channel containing the same pattern signal. One of the channel signals is “clean” and the other includes additive noise. Using feature vectors from these channel signals, a collection of noise correction and scaling vectors is determined. When a feature vector of a noisy pattern signal is later received, it is multiplied by the best scaling vector for that feature vector and the best correction vector is added to the product to produce a noise reduced feature vector. Under one embodiment, the best scaling and correction vectors are identified by choosing an optimal mixture component for the noisy feature vector. The optimal mixture component being selected based on a distribution of noisy channel feature vectors associated with each mixture component.
REFERENCES:
patent: 4718094 (1988-01-01), Bahl et al.
patent: 4918735 (1990-04-01), Morito et al.
patent: 5012519 (1991-04-01), Adlersberg et al.
patent: 5390278 (1995-02-01), Gupta et al.
patent: 5583968 (1996-12-01), Trompf
patent: 5590242 (1996-12-01), Juang et al.
patent: 5604839 (1997-02-01), Acero et al.
patent: 5758022 (1998-05-01), Trompf et al.
patent: 5924065 (1999-07-01), Eberman et al.
patent: 5950157 (1999-09-01), Heck et al.
patent: 6026359 (2000-02-01), Yamaguchi et al.
patent: 6067517 (2000-05-01), Bahl et al.
patent: 6092045 (2000-07-01), Stubley et al.
patent: 6202047 (2001-03-01), Ephraim et al.
patent: 6292775 (2001-09-01), Holmes
patent: 6301561 (2001-10-01), Saul
patent: 6401064 (2002-06-01), Saul
patent: 6446038 (2002-09-01), Bayya et al.
patent: 6490555 (2002-12-01), Yegnanarayanan et al.
patent: 6691091 (2004-02-01), Cerisara et al.
patent: 6778954 (2004-08-01), Kim et al.
patent: 0 301 199 (1989-02-01), None
patent: 0 694 906 (1996-01-01), None
U.S. Appl. No. 10/116,792, filed Apr. 5, 2002, Li Deng et al.
U.S. Appl. No. 10/117,142, filed Apr. 5, 2002, Droppo et al.
U.S. Appl. No. 09/688,950, filed Oct. 16, 2000, Li Deng et al.
U.S. Appl. No. 09/688,764, filed Oct. 16, 2000, Li Deng et al.
“HMM Adaptation Using Vector Taylor Series for Noisy Speech Recognition,” Alex Acero, et al., Proc. ICSLP, vol. 3, 2000, pp. 869-872.
“Sequential Noise Estimation with Optimal Forgetting for Robust Speech Recognition,” Mohomed Afify, et al., Proc. ICASSP, vol. 1, 2001, pp. 229-232.
“High-Performance Robust Speech Recognition Using Stereo Training Data,” Li Deng, et al., Proc. ICASSP, vol. 1, 2001, pp. 301-304.
“ALGONQUIN: Iterating Laplace's Method to Remove Multiple Types of Acoustic Distortion for Robust Speech Recognition,” Brendan J. Frey, et al., Proc. Eurospeech, Sep. 2001, Aalborg, Denmark.
“Nonstationary Environment Compensation Based on Sequential Estimation,” Nam Soo Kim, IEEE Signal Processing Letters, vol. 5, 1998, pp. 57-60.
“On-line Estimation of Hidden Markov Model Parameters Based on the Kullback-Leibler Information Measure,” Vikram Krishnamurthy, et al., IEEE Trans. Sig. Proc., vol. 41, 1993, pp. 2557-2573.
“A Vector Taylor Series Approach for Environment-Independent Speech Recognition,” Pedro J. Moreno, ICASSP, vol. 1, 1996, pp. 733-736.
“Recursive Parameter Estimation Using Incomplete Data,” D.M. Titterington, J. J. Royal Stat. Soc., Vol. 46(B), 1984, pp. 257-267.
“The Aurora Experimental Framework for the Performance Evaluations of Speech Recognition Systems Under Noisy Conditions,” David Pearce, et al., Proc. ISCA IIRW ASR 2000, Sep. 2000.
“Efficient On-Line Acoustic Environment Estimation for FCDCN in a Continuous Speech Recognition System,” Jasha Droppo, et al., ICASSP, 2001.
“Robust Automatic Speech Recognition With Missing and Unreliable Acoustic Data,” Martin Cooke, Speech Communication, vol. 34, No. 3, pp. 267-285, Jun. 2001.
Ephraim, Yariv, “On the Application of Hidden Markov Models for Enhancing Noisy Speech,” IEEE ICASSP vol. 1, conf. 13, p. 533-536, 1988.
Moreno, Pedro J., “Multivariate-Gaussian-Based Cepstral Normalization for Robust Speech Recognition,” Proceedings of ICASSP, p. 137-140, 1995.
Neumeyer, L. and Weintraub, M. “Probabilistic Optimum Filtering For Robust Speech Recognition,” Acoustics, Speech and Signal Processing, ICASSP-94, p. 417-420, Apr. 1994.
“Speech Denoising and Dereverberation Using Probabilistic Models,” Hagai Attias, et al., Advances in NIPS, vol. 13, 2000 pp. 758-764.
“Learning Dynamic Noise Models From Noisy Speech for Robust Speech Recognition,” Brendan J. Frey, et al., Neural Information Processing Systems Conference, 2001, pp. 1165-1121.
“Statistical-Model-Based Speech Enhancement Systems,” Proc. of IEEE, vol. 80, No. 10, Oct. 1992, p. 1526.
“HMM-Based Strategies for Enhancement of Speech Signals Embedded in Nonstationary Noise,” Hossein Sameti, IEEE Trans. Speech Audio Processing, vol. 6, No. 5, Sep. 1998, pp. 445-455.
“Model-based Compensation of the Additive Noise for Continuous Speech Recognition,” J.C. Segura, et al., Eurospeech 2001.
“Large-Vocabulary Speech Recognition Under Adverse Acoustic Environments,” Li Deng, et al., Proc. ICSLP, vol. 3, 2000, pp. 806-809.
“A New Method for Speech Denoising and Robust Speech Recognition Using Probabilistic Models for Clean Speech and for Noise,” Hagai Attias, et al., Proc. Eurospeech, 2001, pp. 1903-1906.
Anastasakos, T., et al., “A Compact Model for Speaker-Adaptive Training,” BBN Systems and Technologies, pp. 1137-1140, 1996.
Boll, S. F., “Suppression of Acoustic Noise in Speech Using Spectral Subtraction,” IEEE Transactions on Acoustics, Speech and Signal Processing, vol. ASSP-27, No. 2, pp. 113-120 (Apr. 1979).
Lockwood, P. et al., “Experiments With a Nonlinear Spectral Subtractor (NSS) , Hidden Markov Models and the Projection, for Robust Speech Recognition in Cars,” Speech Communication 11, pp. 215-228 (1992).
Boll, S.F. , “A Spectral Subtraction Algorithm for Suppression of Acoustic Noise in Speech,” IEEE International Conference on Acoustics, Speech & Signal Processing, pp. 200-203 (Apr. 2-4, 1979).
Berouti, M. et al., “Enhancement of Speech Corrupted by Acoustic Noise,” IEEE International Conference on Acoustics, Speech & Signal Processing, pp. 208-211 (Apr. 2-4, 1979).
Acero, A., “Acoustical and Environmental Robustness in Automatic Speech Recognition, ”Department of Electrical and Computer Engineering, Carnegie Mellon University, pp. 1-141 (Sep. 13, 1990).
Moreno, P., “Speech Recognition in Noisy Environments,” Department of Electrical and Computer Engineering, Carnegie Mellon University, pp. 1-130 (Apr. 22, 1996).
Prosecution Pack for U.S. Appl. No. 09/688,764 consisting of All OA: Nov. 5, 2003; Jan. 24, 2005 Responses: Jan. 21, 2004; Jun. 18, 2004; Feb. 2, 2005.
Zhao, Yunxin, “Frequency-Domain Maximum Likelihood Estimation for Automatic Speech Recognition in Additive and Convolutive Noises,” IEEE Transactions on Speech and Audio Processing, vol. 8, No. 3, pp. 255-266, May 2000.
Copy of Official Communication from Application No. 01124142.9, filed Oct. 10, 2001.
Acero Alejandro
Deng Li
Huang Xuedong
Magee Theodore M.
Microsoft Corporation
Opsasnick Michael N
Westman Champlin & Kelly P.A.
LandOfFree
Method of noise reduction using correction and scaling... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method of noise reduction using correction and scaling..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method of noise reduction using correction and scaling... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3849279