System and method for correction of speech recognition mode...

Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S270000

Reexamination Certificate

active

06581033

ABSTRACT:

TECHNICAL FIELD
This invention relates generally to the field of computer systems and, more particularly to correcting a speech recognition mode error in a computer software program when the incorrect mode has been previously selected and speech input has been incorrectly input into the program.
BACKGROUND OF THE INVENTION
Since the advent of the personal computer, human interaction with the computer has been primarily through the keyboard. Typically, when a user wants to input information or to enter a command into a computer, the information or the command is typed on a keyboard attached to the computer. Other input devices have supplemented the keyboard as an input device, including the mouse, touch-screen displays, the integrated pointer device, and scanners. Use of these other input devices have decreased the amount of user time spent in entering data or commands into the computer.
Computer-based voice recognition and speech recognition systems have also been used for data or command input into personal computers. Voice recognition and speech recognition systems convert human speech into a format that can understood by the computer. When a computer is equipped with a voice recognition or speech recognition system, data and command input can be performed by merely speaking the data or command to the computer. The speed at which the user can speak is typically faster than conventional data or command entry. Therefore, the inherent speed in disseminating data or commands through human speech is a sought after advantage of incorporating voice recognition and speech recognition systems into personal computers.
Throughout the remainder of this disclosure, the terms “voice recognition” and “speech recognition” will be used synonymously. In some instances, a distinction is made between voice recognition and speech recognition. However, both voice recognition and speech recognition systems suffer from the same problems described herein, and the same solutions have been applied to both recognition technologies to resolve the shortcomings of the prior art.
The increased efficiency of users operating personal computers equipped with speech recognition systems has encouraged the use of such systems in the workplace. Many workers in a variety of industries now utilize speech recognition systems for numerous applications. For example, computer software programs utilizing voice recognition and speech recognition technologies have been created by DRAGON, IBM, and LERNOUT & HAUSPIE. When a user reads a document aloud or dictates to a speech recognition program, the program can enter the user's spoken words directly into a word processing program operating on a personal computer.
Generally, computer-based and speech recognition programs convert human speech into a series of digitized frequencies. These frequencies are matched against a previously stored set of words, or phonemes. When the computer determines correct matches for the series of frequencies, computer recognition of that portion of human speech is accomplished. The frequency matches are compiled until sufficient information is collected for the computer to react. The computer can then react to certain spoken words by storing the human speech in a memory device, transcribing the human speech into a document for a word processing program, or executing a command in a program module, such as an application program.
However, speech recognition systems are not 100% reliable. Even with hardware and software modifications, the most proficient speech recognition systems can attain approximately 97-99% reliability. Internal and external factors can affect the reliability of speech recognition systems. Factors dependent upon the recognition technology itself include the finite set of words or phonemes and the vocabulary of words to compare the speaker's input to. Environmental factors such as regional accents, external noise, and the microphone can degrade the quality of the input, thus affecting the frequency of the user's words and introducing potential error into the word or phoneme matching.
A speech recognition software program can be used to input commands or text into other application programs. For example, Kurzweil's “VOICEPRO” speech recognition software can be used to input text or commands into a document created by a word processing application program, such as MICROSOFT WORD. When a user chooses to use the speech recognition program to enter a command, the user manually selects the command mode in the speech recognition program. The user then speaks the command, such as “delete”. The speech recognition program processes the command, and sends the “delete” command to the word processing program for execution of the command. Most mode selection is done automatically, and the errors come from the machine getting the mode wrong rather than user error. The net effect is the same, though. If the user chooses to use the speech recognition program to enter text into a document, the user manually selects the dictation mode in the speech recognition program. The user then begins to speak the text to be input, such as “where do you want to go today”. The speech recognition program processes the speech, and sends the processed speech to the word processing program to be input into the document. The user selection of a mode is necessary for the speech recognition software to correctly process the user's speech input. Manual selection of the speech recognition mode before the user speaks is cumbersome and time consuming.
Occasionally, the user forgets to change the mode of the speech recognition program before speaking. For example, if the speech recognition program is in the command mode and the user says “copy machines make copies not coffee”, the speech recognition program will process the speech input “copy machines make copies not coffee” as a command. The speech input “copy” will be executed by the application program, but the remaining speech may not be understood as a command, and the application program will not process the speech.
On other occasions, the speech recognition program will be in the dictation mode and the user will want the word processor to execute a command. If the user forgets to change the mode and says “copy”, the speech recognition program will process the speech as dictation and the speech input will be entered as text into the application program.
Various solutions to the mode error problem have been attempted. The typical correction procedure involves the circumstance described above, when the user forgets to change the mode before speaking, resulting in a mode error. Sometimes, the mode error is compounded by the circumstance where the user does not realize he is in the wrong mode and the speech input is processed in the incorrect mode from the time the initial mode error was made. If the speech input has been incorrectly input as dictation, then the user can manually delete the dictation that has been input into the application program as text. The user continues the correction procedure by manually selecting the command mode before speaking again. If the speech input has been incorrectly input as a command, then the user can manually “undo” the executed command in the application program. The user continues the correction procedure by manually selecting the dictation mode before speaking again. The manual selection of the correct speech recognition mode and the manual correction of the “undo” or “delete” commands can be cumbersome and time consuming.
Thus, there is a need in the art for a method that reduces user time in correcting speech recognition mode errors.
There is a further need in the art for a method that reduces the number of keystrokes or commands in correcting speech recognition mode errors.
SUMMARY OF THE INVENTION
The present invention meets the needs described above in a speech engine correction module for correcting speech recognition mode errors. The speech engine correction module can reduce user time in correcting speech recognition mode errors. Furthermore, t

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

System and method for correction of speech recognition mode... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with System and method for correction of speech recognition mode..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for correction of speech recognition mode... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3095754

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.