Detecting speech recognition errors in an embedded speech...

Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S231000, C704S251000, C704S244000

Reexamination Certificate

active

06754627

ABSTRACT:

CROSS REFERENCE TO RELATED APPLICATIONS
(Not Applicable)
STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT
(Not Applicable)
BACKGROUND OF THE INVENTION
1. Technical Field
This invention relates to the field of embedded speech recognition systems and more particularly to detecting speech recognition errors in an embedded speech recognition system.
2. Description of the Related Art
Speech recognition is the process by which an acoustic signal received by microphone is converted to a set of text words by a computer. These recognized words may then be used in a variety of computer software applications for purposes such as document preparation, data entry, and command and control. Speech recognition systems programmed or trained to the diction and inflection of a single person can successfully recognize the vast majority of words spoken by that person.
In operation, speech recognition systems can model and classify acoustic signals to form acoustic models, which are representations of basic linguistic units referred to as phonemes. Upon receipt of the acoustic signal, the speech recognition system can analyze the acoustic signal, identify a series of acoustic models within the acoustic signal and derive a list of potential word candidates for the given series of acoustic models. Subsequently, the speech recognition system can contextually analyze the potential word candidates using a language model as a guide.
The task of the language model is to express restrictions imposed on the manner in which words can be combined to form sentences. The language model can express the likelihood of a word appearing immediately adjacent to another word or words. Language models used within speech recognition systems typically are statistical models. Examples of well-known language models suitable for use in speech recognition systems include uniform language models, finite state language models, grammar based language models, and m-gram language models.
Notably, the accuracy of a speech recognition system can improve as word combination statistics collected in a language model are refined during the operation of the speech recognition system. That is, the speech recognition system can observe speech dictation as it occurs and can modify the language model statistics in the language model as correct combinations of words are observed. In consequence, when a misrecognition occurs, it is important to update the language model in order to properly reflect an accurate combination of words as specified by the user. In order to update the language model, however, generally the user first must inform the speech recognition system that a misrecognition has occurred.
In an embedded computer system, typical personal computing peripherals such as a keyboard, mouse, display and graphical user interface (GUI) often do not exist. As such, the lack of a conventional mechanism for interacting with a user can inhibit effective user interaction with an embedded computer system. This problem can become exacerbated where a speech recognition system is an operational component of an embedded computer system. In particular, without an effective mechanism for notifying a speech recognition system when a misrecognition has occurred, the speech recognition system cannot appropriately update the corresponding speech recognition system language model so as to reduce future misrecognitions.
SUMMARY OF THE INVENTION
An embedded speech recognition system in accordance with the inventive arrangements can include an embedded computer system; a speech recognition system configured for operation in the embedded computer system; a speech-enabled application for processing text converted in the speech recognition system; and, misrecognition error logic for notifying the speech recognition system when a misrecognition error has occurred. The embedded speech recognition system can further include an activatable error notification button coupled to the embedded computer system, the button triggering the misrecognition error logic when activated. The embedded computer system in the embedded speech recognition system can include a central processing unit (CPU); memory; audio circuitry; and, an audio input device. An audio output device optionally can be included. In addition, the embedded speech recognition system can further include at least one speech recognition language model stored in the memory.
A method for processing a misrecognition error in an embedded speech recognition system during a speech recognition session can include speech-to-text converting audio input in the embedded speech recognition system based on an active language model, the speech-to-text conversion producing speech recognized text; presenting the speech recognized text through a user interface; detecting a user-initiated misrecognition error notification; and, responsive to detecting the error notification, providing the audio input and a reference to the active language model to a speech recognition system training process associated with the embedded speech recognition system.
Importantly, the detecting step can include receiving a hardware-generated notification caused by the activation of an error notification button. Alternatively, the detecting step can include receiving a software-generated notification caused by the receipt of a error notification speech command. An exemplary error notification speech command can include, “Recognition Error” or “Misrecognition”. Finally, the providing step can include storing the audio input; storing a reference to the active language model; and, providing the stored audio input and reference to the training process subsequent to the speech recognition session.


REFERENCES:
patent: 4866778 (1989-09-01), Baker
patent: 5606644 (1997-02-01), Chou et al.
patent: 5689617 (1997-11-01), Pallakoff et al.
patent: 6064961 (2000-05-01), Hanson
patent: 6138099 (2000-10-01), Lewis et al.
patent: 6185535 (2001-02-01), Hedin et al.
patent: 6219643 (2001-04-01), Cohen et al.
patent: 6243481 (2001-06-01), Tao
patent: 6587824 (2003-07-01), Everhart et al.
patent: 635820 (1997-02-01), None
Khan (Neufuz: The Integration of Fuzzy Logic and Neural Nets Offers Designers Tremendous Application Advantage—Today WESCON/'93 Conference Record, Sep. 1993).*
“IEEE 100” The Authoritative Dictionary of IEEE Standards Terms, 7th edition, IEEE Press, © 2000.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Detecting speech recognition errors in an embedded speech... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Detecting speech recognition errors in an embedded speech..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Detecting speech recognition errors in an embedded speech... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3321187

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.