Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Reexamination Certificate
1999-11-08
2001-06-19
Korzuch, William R. (Department: 2641)
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
Reexamination Certificate
active
06249760
ABSTRACT:
FIELD OF THE INVENTION
The present invention is related to the field of speech recognition systems and more particularly to a speech reference enrollment method.
BACKGROUND OF THE INVENTION
Both speech recognition and speaker verification application often use an enrollment process to obtain reference speech patterns for later use. Speech recognition systems that use an enrollment process are generally speaker dependent systems. Both speech recognition systems using an enrollment process and speaker verification systems will be referred herein as speech reference systems. The performance of speech reference systems is limited by the quality of the reference patterns obtained in the enrollment process. Prior art enrollment processes ask the user to speak the vocabulary word being enrolled and use the extracted features as the reference pattern for the vocabulary word. These systems suffer from unexpected background noise occurring while the user is uttering the vocabulary word during the enrollment process. This unexpected background noise is then incorporated into the reference pattern. Since the unexpected background noise does not occur every time the user utters the vocabulary word, it degrades the ability of the speech reference system's ability to match the reference pattern with a subsequent utterance.
Thus there exists a need for an enrollment process for speech reference systems that does not incorporate unexpected background noise in the reference patterns.
SUMMARY OF THE INVENTION
A speech reference enrollment method that overcomes these and other problems involves the following steps: (a) requesting a user speak a vocabulary word; (b) detecting a first utterance; (c) requesting the user speak the vocabulary word; (d) detecting a second utterance; (e) determining a first similarity between the first utterance and the second utterance; (f) when the first similarity is less than a predetermined similarity, requesting the user speak the vocabulary word; (g) detecting a third utterance; (h) determining a second similarity between the first utterance and the third utterance; and (i) when the second similarity is greater than or equal to the predetermined similarity, creating a reference.
REFERENCES:
patent: 4535473 (1985-08-01), Sakata
patent: 4630305 (1986-12-01), Borth et al.
patent: 4912766 (1990-03-01), Forse
patent: 4937870 (1990-06-01), Bossemeyer, Jr.
patent: 5742694 (1998-04-01), Eatwell
Ameritech Corporation
Halling Dale B.
Korzuch William R.
Storm Donald L.
LandOfFree
Apparatus for gain adjustment during speech reference... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Apparatus for gain adjustment during speech reference..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Apparatus for gain adjustment during speech reference... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2539972