Speech recognition enrollment for non-readers and...

Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition

Reexamination Certificate

Rate now

[ 0.00 ] – not rated yet Voters 0 Comments 0

Details Speech recognition enrollment for non-readers and... Speech recognition enrollment for non-readers and...

: 1999-02-10
: 2001-11-27
: Korzuch, William (Department: 2641)
: Data processing: speech signal processing, linguistics, language
: Speech signal processing
: Recognition

: C704S275000, C379S088010
: Reexamination Certificate
: active
: 06324507
: ABSTRACT:

BACKGROUND OF THE INVENTION
1. Field of the Invention
This invention relates generally to the field of speech recognition systems, and in particular, to speech recognition enrollment for non-readers and displayless devices.
2. Description of Related Art
Users of speech recognition programs need to enroll, that is provide a sample for processing by the recognition system, in order to utilize the speech recognition system with maximum accuracy. When a user can read aloud fluently, it is easy to collect such a sample. When the user cannot read fluently for any reason, or when the speech system does not provide for a display device, collecting such a sample has thus far not been practical. Speech recognition systems can be implemented in connection with telephone and centralized dictation systems, which need not have display monitors as part of the equipment.
Recent years have brought significant improvements to speech recognition software. Speech recognition software, also referred to as a speech recognition engine, constructs text from the acoustic signal of a user's speech, either for purposes of dictation or command and control. Current systems sometimes allow users to speak to the system using a speaker-independent model to allow users to begin working with the software as quickly as possible. However, recognition accuracy is best when a user enrolls with the system.
During normal enrollment, the system presents text to the user, and records the user's speech while the user reads the text. This approach works well provided that the user can read fluently. When the user is not fluent in the language for which the user is enrolling, this approach will not work.
There are many reasons why a user might be a less than fluent. The following list is exemplary: the user can be a child who is just beginning to read; the user can be a child or adult having one or more learning disabilities that make reading unfamiliar material difficult; the user can be a user who speaks fluently, but has trouble reading fluently; the user can be enrolling in a system designed to teach the user a second language; and, the user can be enrolling in a system using a device that has no display, so there is nothing to read.
There is a long-felt need to provide speech recognition enrollment for non-readers and for speech systems without display devices.
SUMMARY OF THE INVENTION
An enrollment system must have certain properties in addition to those in systems for fluent readers in order to support users who are non-readers and users without access to display devices. In accordance with the inventive arrangements, the most important additional property is an ability to read the text to the user before expecting the user to read the text. This can be accomplished by using text-to-speech (TTS) tuned to ensure that the audible output faithfully produces the words with the correct pronunciation for the text, or by using recorded audio. Given adequate system resources, recorded audio is presently preferred as sounding more natural, but in systems with limited resources, for example handheld devices in a client-server system, TTS can be a better choice.
Thus, the long-felt need of the prior art is satisfied by providing the enrollment text to the user via an audio channel, with adjustments to the standard user interface to provide for an easy-to-understand sequence of events.
A method for enrolling a user in a speech recognition system without requiring reading, in accordance with the inventive arrangements, comprises the steps of: generating an audio user interface having an audible output and an audio input; audibly playing a text phrase; audibly prompting the user to speak the played text phrase; repeating the steps of audibly playing the text phrase and audibly prompting the user to speak, for a plurality of further text phrases; and, processing enrollment of the user based on the audibly prompted and subsequently spoken text phrases.
The method can further comprise the step of audibly playing a further one of the plurality of further text phrases only if the spoken phrase was received.
The method can further comprise the step of repeating the steps of audibly playing the text phrase and audibly prompting the user to speak for the most recently played text phrase if the spoken text phrase was not received.
The method can further comprise the step of audibly prompting the user, prior to the audibly playing step, not to speak while the text phrase is played.
The method can further comprise the step of generating audible user-progress notifications during the course of the enrollment.
The method can further comprise the step of audibly prompting the user in a first voice and playing said text phrases in a second voice.
The method can comprise the step of audibly playing at least some of the text phrases from recorded audio, audibly playing at least some of the text phrases with a text-to-speech engine, or both. Similarly, the user can be audibly prompted from recorded audio, with a text-to-speech engine, or both.
The method can further comprise the steps of: generating a graphical user interface concurrently with the step of generating the audio user interface; and, displaying text corresponding to the text phrases and to the audible prompts.
The method can further comprise the steps of: displaying a plurality of icons for user activation; and, selectively distinguishing different ones of the plurality of icons at different times by at least one of: color; shape; and, animation.
A computer apparatus programmed with a set of instructions stored in a fixed medium, for enrolling a user in a speech recognition system without requiring reading, in accordance with the inventive arrangements, comprises: means for generating an audio user interface having an audible output and an audio input; means for audibly playing a text phrase; and, means for audibly prompting the user to speak the played text phrase.
The apparatus can further comprise means for generating audible user-progress notifications during the course of the enrollment.
The means for audibly playing the text phrases can comprise means for playing back prerecorded audio, a text-to-speech engine, or both.
The apparatus can further comprise: means for generating a graphical user interface concurrently with the audio user interface; and, means for displaying text corresponding to the text phrases and to the audible prompts.
The apparatus can also further comprise: means for displaying a plurality of icons for user activation; and, means for selectively distinguishing different ones of the plurality of icons at different times by at least one of: color; shape; and, animation.

REFERENCES:
patent: 5502759 (1996-03-01), Cheng et al.
patent: 5569038 (1996-10-01), Tubman et al.
patent: 5592583 (1997-01-01), Sakurai
patent: 5659597 (1997-08-01), Bareis et al.
patent: 5717738 (1998-02-01), Gammel
patent: 5850629 (1998-12-01), Holm et al.
patent: 5950167 (1999-09-01), Yaker
patent: 6017219 (2000-01-01), Adams, Jr. et al.
patent: 6075534 (2000-06-01), VanBuskirk et al.
patent: 6122614 (2000-09-01), Kahn et al.
patent: 0867857A3 (1998-09-01), None
patent: WO98/45834 (1998-10-01), None
“Example Enrollment Text Playback for an Automatic Speech Recognizer” IBM Technical Disclosure Bulletin, US, IBM Corp., New York. vol. 36, No. 3, Mar. 1, 1993, p. 413 XP000354828.

Affiliated with

Buskirk Ron Van

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Lewis James R.

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Ortega Kerry A.

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Wang Huifang

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Also associated with

Akerman & Senterfitt

Law Firm

[ 0.00 ] – not rated yet Voters 0 Comments 0

International Business Machines Corp.

Corporate Assignee

[ 0.00 ] – not rated yet Voters 0 Comments 0

Korzuch William

Examiner

[ 0.00 ] – not rated yet Voters 0 Comments 0

Lerner Martin

Examiner

[ 0.00 ] – not rated yet Voters 0 Comments 0

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Speech recognition enrollment for non-readers and... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Speech recognition enrollment for non-readers and..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech recognition enrollment for non-readers and... will most certainly appreciate the feedback.

Rate now

Comments { 0 }

Profile ID: LFUS-PAI-O-2615413

All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.

Canada

Charities
Companies
MP Candidates
Patents
Employee Salary Disclosure

World

Places of the World
Scientific Papers

United States

Banks
Companies
Counties
Patents
Employee Salary Disclosure