Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
1999-10-12
2002-12-31
{haeck over (S)}mits, T{overscore (a)}livaldis Ivars (Department: 2654)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S256000, C704S254000
Reexamination Certificate
active
06502072
ABSTRACT:
BACKGROUND OF THE INVENTION
The present invention relates to speech recognition. In particular, the present invention relates to noise rejection in speech recognition.
In speech recognition systems, an input speech signal is converted into words that represent the verbal content of the speech signal. This conversion is complicated by many factors including interfering sounds, which are generically referred to as noise. Noise includes such things as the sounds made when the speaker clears their throat or smacks their lips. It also includes external sounds such as the sound of footsteps, the sound of someone knocking at a door, and the sound of a phone ringing.
Since most speech recognition systems work by matching sounds to the basic acoustic units of speech, for example senones or phonemes, many speech recognition systems will identify noise as one or more words. For instance, if a user types on a keyboard during speech recognition, the sound of the typing may be interpreted as the word “its”.
To avoid such false acceptance, some speech recognition systems add models of noise to the acoustic models used for speech recognition. These models rely on a noise entry found in a lexicon for the speech recognizer. For example, a model would be created for the sound associated with knocking on a door. Because the model relies on an entry in the lexicon, noises that are not in the lexicon cannot be identified as noise by these models and are usually identified as a word. Since there is a wide variety of noises, it is impossible to include all noises in the lexicon. As such, there are a large number of noises that are improperly recognized as words in prior art speech recognition systems.
SUMMARY OF THE INVENTION
A method and apparatus is provided for two-tier noise rejection in speech recognition. The method and apparatus convert an analog speech signal into a digital signal and extract features from the digital signal. Hypothesis speech words and hypothesis noise words are identified from extracted features in a first tier of noise rejection by modeling common noises as words in a lexicon. The features associated with the hypothesis speech words are examined in a second tier of noise rejection to determine if the features are more likely to represent noise than speech. The hypothesis speech words are replaced by a noise marker if the features are more likely to represent noise than speech.
REFERENCES:
patent: 5797123 (1998-08-01), Chou et al.
patent: 2001/0018654 (2001-08-01), Hon et al.
Richard C. Rose and Douglas B. Paul, “A Hidden Markov Model Based Keyword Recognition System,” Proc. IEEE ICASSP 90, vol. 1, p. 129-132, Apr. 1990.*
Jay G. Wilpon, Lawrence R. Rabiner, Chin-Hui Lee, and E. R. Goldman, “Automatic Recognition of Keywords in Unconstrained Speech Using Hidden Markov Models,” IEEE Trans. ASSP, vol. 38, No. 11, p. 1870-1878, Nov. 1990.*
Richard C. Rose and E. Lleida, “Speech Recognition Using Automatically Derived Baseforms,” Proc. IEEE ICASSP 97, Vojl. 2, p. 1271-1274, Apr. 1997.*
Rafid A. Sukkar and Jay G. Wilpon, “A Two Pass Classifier for Utterance Rejection in Keyword Spotting,” Proc. IEEE ICASSP 93, vol. 2, p. 451-545, Apr. 1993.
Huang Xuedong
Jiang Li
Magee Theodore M.
Microsoft Corporation
Westman Champlin & Kelly P.A.
{haeck over (S)}mits T{overscore (a)}livaldis Ivars
LandOfFree
Two-tier noise rejection in speech recognition does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Two-tier noise rejection in speech recognition, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Two-tier noise rejection in speech recognition will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2994152