Compound words in speech recognition systems

Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S255000

Reexamination Certificate

active

06349282

ABSTRACT:

TECHNICAL FIELD
The present invention relates generally to speech recognition systems, and in particular, to the handling of compound words in the recognition results of such systems.
BACKGROUND ART
Computer recognition of speech is field of great complexity. Speech recognition poses difficult problems in many areas, and while never easy, individual languages have different problems which affect overall recognition successfulness. For several years, speech recognition systems were “isolated word” and required a user to pause between words. With increased computer power available, and ever more sophisticated recognition techniques, commercially available speech recognition systems are now “large vocabulary continuous” in which no pausing is required between words. In fact, such systems are even more accurate when a user does not pause between words, but speaks in multiple word phrases. However, with the advent of large vocabulary continuous speech recognition systems, new problems have emerged which did not exist or were not equally significant in isolated word recognition systems.
One such problem is how to deal with compound words—that is, words formed by concatenating component word parts. Some languages, such as German and Dutch, have a relatively high percentage of compound words. As an example, the Dutch word “rentevoet” is a compound formed of constituent component parts “rente” and “voet” Such compound words may form a significant fraction of all the “out of vocabulary” (OOV) words encountered by a recognition system. However, attempting to include such compound words in the system recognition vocabulary greatly increases the size of the recognition vocabulary.
SUMMARY OF THE INVENTION
A preferred embodiment of the present invention provides a postprocessor of a speech recognition system for generating compound words from a recognition result having a sequence of recognized words representative of an input utterance, the sequence including compound word components. The postprocessor has a compound lexicon and a compounder. The compound lexicon contains a plurality of compound words composed of compound word components and connecting links. The compounder replaces, in the sequence of recognized words, adjacent words that have corresponding linked components in the lexicon with a compound word in the compound lexicon composed of the adjacent words.
In a further embodiment, the replacement with the compound word may include an adjustment of the components for agreement for at least one of number, person, gender, and tense, or the addition of component-linking morphemes. The components may have a length greater than or equal to a selected minimum component length. The compound word entries in the compound lexicon may also include an ambiguity indicator field having a value indicating whether the components for a given compound word occur more frequently in compounded or uncompounded form.
In another further embodiment, the compounder may produce an output representing a best recognition hypothesis and at least one alternative recognition hypothesis, such that when the compounder performs a given compound word replacement, one hypothesis is generated that contains the given compound word, and one hypothesis is generated that contains the uncompounded components of the given compound word. In such an embodiment, when the compounder performs a compound word replacement, the best recognition hypothesis either may contain the given compound word, or alternatively, the best recognition hypothesis may contain the uncompounded components of the given compound word.
In an embodiment, the compound lexicon may contain a given compound word only when the components of the given compound word are more likely to occur together in a compound word than to occur as separate words. The compound lexicon also may contain a selected number of most frequently occurring compound words present in a text corpus.
A related embodiment includes an automatic speech recognition system having the postprocessor of one of the above embodiments. The automatic speech recognition system may be a large-vocabulary continuous speech recognition system.
Another preferred embodiment includes a method of a speech recognition system for postprocessing a recognition result having a sequence of recognized words representative of an input utterance so as to generate compound words, the sequence including compound word components. The method includes providing a compound lexicon that contains a plurality of compound words composed of compound word components and connecting links, and replacing, in the sequence of recognized words, adjacent words that have corresponding linked components in the lexicon with a compound word in the compound lexicon composed of the adjacent words.
In a further related embodiment, replacing with the compound word may include adjusting the components for agreement for at least one of number, person, gender, and tense, or adding component-linking morphemes. The components may have a length greater than or equal to a selected minimum component length The compound word entries in the compound lexicon may also include an ambiguity indicator field having a value indicating whether the components for a given compound word occur more frequently in compounded or uncompounded form.
In a further related embodiment, replacing with a compound word may include producing an output representing a best recognition hypothesis and at least one alternative recognition hypothesis, such that when replacing a given compound word, one hypothesis is generated that contains the given compound word, and one hypothesis is generated that contains the uncompounded components of the given compound word. In such an embodiment, when replacing a given compound word, the best recognition hypothesis may contain the given compound word, or alternatively, the best recognition hypothesis may contain the uncompounded components of the given compound word.
In an embodiment, the compound lexicon may contain a given compound word only when the components of the given compound word are more likely to occur together in a compound word than to occur as separate words. Or, the compound lexicon may contain a selected number of most frequently occurring compound words present in a text corpus.
An embodiment also includes an automatic speech recognition system using the method of one of the above embodiments. In such an embodiment, the system may be a large-vocabulary continuous speech recognition system.
Another preferred embodiment includes an automatic speech recognition system having a recognition engine, a compound lexicon and a compounder. The recognition engine generates a recognition result having a sequence of recognized words representative of an input utterance, the sequence including compound word components. The engine uses a recognition vocabulary of words and a language model which, for a given position in the sequence of recognized words and for selected words in the recognition vocabulary, associates an occurrence probability of such word occurring at such position. The compound lexicon contains a plurality of compound words composed of compound word components and connecting links. The compounder replaces, in the sequence of recognized words, adjacent words that have corresponding linked components in the lexicon with a compound word in the compound lexicon composed of the adjacent words. In addition, the language model occurrence probabilities prevent a given component in the lexicon from occurring in the recognition result unless the given component is adjacent to at least one other component such that, for the adjacent components, the lexicon contains linked entries corresponding to a compound word in the lexicon.
In a further related embodiment, the replacement with the compound word may include an adjustment of the components for agreement for at least one of number, person, gender, and tense, or the addition of component-linking morphemes. The compound components may have a length greater than or equal to a selected

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Compound words in speech recognition systems does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Compound words in speech recognition systems, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Compound words in speech recognition systems will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2964433

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.