Combining N-best lists from multiple speech recognizers

Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition

Reexamination Certificate

Rate now

[ 0.00 ] – not rated yet Voters 0 Comments 0

Details Combining N-best lists from multiple speech recognizers Combining N-best lists from multiple speech recognizers

: 2001-06-13
: 2004-03-02
: McFadden, Susan (Department: 2655)
: Data processing: speech signal processing, linguistics, language
: Speech signal processing
: Recognition

: C704S255000, C704S246000, C704S250000
: Reexamination Certificate
: active
: 06701293
: ABSTRACT:

BACKGROUND
1. Field
This disclosure relates to speech recognition systems, more particularly to methods to combine the N-best lists from multiple recognizers.
2. Background
Speech recognizers are those components used in speech recognition systems that perform the actual conversion from the incoming audio stream to text or commands. The recognizer uses algorithms to match what the user says to elements in a speech model. The recognizer then returns text corresponding to user's speech to the application utilizing the speech recognition. In one example, the algorithms are run on a digital signal processor. However, even with powerful processors and detailed speech models, errors still occur. Word recognition rates are generally better than 90%, but failures occur, especially over sequences of words.
Because of uncertainties in the recognition process, the speech recognizer may return several possible text results and allow the application that requested the recognition to select the most appropriate result based on knowledge it possesses regarding the user, the task, the context or other factors. Many speech recognizers support this concept of N-best recognition. The recognizer returns a list of elements that the user might have said, typically accompanied by a score of how confident the recognizer is of each potential match. This list will be referred to here as an N-best list. The application software then decides which entry in the N-best list to use.
Current speech recognition applications use only a single recognizer. However, many speech recognition applications may benefit from the use of several different recognizers. Different recognizers from different manufacturers perform differently even if targeted at the same market. This is due to the use of different algorithms to perform the speech recognition and different training data used to create speech models used by the recognizers. If multiple recognizers are used concurrently, several different N-best lists may be returned to the application. Recognition accuracy could be degraded if the N-best list selected is from a recognizer with poor performance in a particular situation.
Therefore, it would seem useful to have a process for selecting which recognizers should process an audio stream and one for combining N-best lists from different recognizers into one N-best list prior to the list being returned to the application.
SUMMARY
One aspect of the disclosure is a speech recognition system. The system includes a port for receiving an input audio stream and one or more recognizers operable to convert the input audio stream from speech to text or commands. The system also includes a combiner operable to combine lists of possible results produced by each recognizer into a combined list. Some subset of the combined list is then sent back to the application, allowing the application to select the desired conversion result.
Another aspect of the disclosure is a method to utilize multiple speech recognizers. An input audio stream is routed to the enabled recognizers. The method of selecting the enabled recognizers is discussed below. A combiner receives a list of possible results from each of the enabled recognizers and combines the lists into a combined list and then returns a subset of that list to the application.
Another aspect of the disclosure is a method of combining N-best lists from multiple speech recognizers. A combiner receives an N-best list from each enabled speech recognizer and combines the entries in each list into an initial N-best list. The N-best list is then potentially reduced in size and sorted according to at least one sorting criteria. A subset of entries in the resulting sorted N-best list is then returned to the application.

REFERENCES:
patent: 5651096 (1997-07-01), Pallakoff et al.
patent: 5983177 (1999-11-01), Wu et al.
patent: 6377922 (2002-04-01), Brown et al.

Affiliated with

Anderson Andrew V.

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Bennett Steven M.

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Also associated with

Intel Corporation

Corporate Assignee

[ 0.00 ] – not rated yet Voters 0 Comments 0

Marger Johnson & McCollom PC

Law Firm

[ 0.00 ] – not rated yet Voters 0 Comments 0

McFadden Susan

Examiner

[ 0.00 ] – not rated yet Voters 0 Comments 0

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Combining N-best lists from multiple speech recognizers does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Combining N-best lists from multiple speech recognizers, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Combining N-best lists from multiple speech recognizers will most certainly appreciate the feedback.

Rate now

Comments { 0 }

Profile ID: LFUS-PAI-O-3252999

All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.

Canada

Charities
Companies
MP Candidates
Patents
Employee Salary Disclosure

World

Places of the World
Scientific Papers

United States

Banks
Companies
Counties
Patents
Employee Salary Disclosure