System for developing word-pronunciation pairs

Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S257000, C704S251000, C704S260000

Reexamination Certificate

active

06363342

ABSTRACT:

BACKGROUND AND SUMMARY OF THE INVENTION
The present invention relates generally to speech recognition and speech synthesis systems. More particularly, the invention relates to developing word-pronunciation pairs.
Computer-implemented and automated speech technology today involves a confluence of many areas of expertise, ranging from linguistics and psychoacoustics, to digital signal processing and computer science. The traditionally separate problems of text-to-speech (TTS) synthesis and automatic speech recognition (ASR) actually present many opportunities to share technology. Traditionally, however, speech recognition and speech synthesis has been addressed as entirely separate disciplines, relying very little on the benefits that cross-pollination could have on both disciplines.
We have discovered techniques, described in this document for combining speech recognition and speech synthesis technologies to the mutual advantage of both disciplines in generating pronunciation dictionaries. Having a good pronunciation dictionary is key to both text-to-speech and automatic speech recognition applications. In the case of text-to-speech, the dictionary serves as the source of pronunciation for words entered by graphemic or spelled input. In automatic speech recognition applications, the dictionary serves as the lexicon of words that are known by the system. When training the speech recognition system, this lexicon identifies how each word is phonetically spelled, so that the speech models may be properly trained for each of the words
In both speech synthesis and speech recognition applications, the quality and performance of the application may be highly dependent on the accuracy of the pronunciation dictionary. Typically, it is expensive and time consuming to develop a good pronunciation dictionary, because the only way to obtain accurate data has heretofore been through use of professional linguists, preferably a single one to guarantee consistency. The linguist painstakingly steps through each word and provides its phonetic transcription.
Phonetic pronunciation dictionaries are available for most of the major languages, although these dictionaries typically have a limited word coverage and do not adequately handle proper names, unusual and compound nouns, or foreign words. Publicly available dictionaries likewise fall short when used to obtain pronunciations for a dialect different from the one for which the system was trained or intended.
Currently available dictionaries also rarely match all of the requirements of a given system. Some systems (such as text-to-speech systems) need high accuracy; whereas other systems (such as some automatic speech recognition systems) can tolerate lower accuracy, but may require multiple valid pronunciations for each word. In general, the diversity in system requirements compounds the problem. Because there is no “one size fits all” pronunciation dictionary, the construction of good, application-specific dictionaries remains expensive.
The present invention provides a system and method for developing word-pronunciation pairs for use in a pronunciation dictionary. The invention provides a tool, which builds upon a window environment to provide a user-friendly methodology for defining, manipulating and storing the phonetic representation of word-pronunciation pairs in a pronunciation dictionary. Unlike other phonetic transcription tools, the invention requires no specific linguistic or phonetic knowledge to produce the pronunciation lexicon. It utilizes various techniques to quickly provide the best phonetic representation of a given word along with different means for “fine tuning” this phonetic representation to achieve the desired pronunciation. Immediate feedback to validate word-pronunciation pairs is also provided by incorporating a text-to-speech synthesizer. Applications will quickly become apparent as developments expand in areas where exceptions to the rules of pronunciation are common, such as streets, cities, proper names and other specialized terminology.
For a more complete understanding of the invention, its objects and advantages refer to the following specification and to the accompanying drawings.


REFERENCES:
patent: 4831654 (1989-05-01), Dick
patent: 5212730 (1993-05-01), Wheatley et al.
patent: 5278943 (1994-01-01), Gasper et al.
patent: 5384893 (1995-01-01), Hutchins
patent: 5432948 (1995-07-01), Davis et al.
patent: 5682501 (1997-10-01), Sharman
patent: 5751906 (1998-05-01), Silverman
patent: 5794197 (1998-08-01), Alleva et al.
patent: 5832434 (1998-11-01), Meredith
patent: 5835892 (1998-11-01), Kanno
patent: 5933804 (1999-08-01), Huang et al.
patent: 6016471 (2000-01-01), Kuhn et al.
patent: 6029132 (2000-02-01), Kuhn et al.
patent: 6078885 (2000-06-01), Beutnagel
patent: 6092044 (2000-07-01), Baker et al.
patent: 6163769 (2000-12-01), Acero et al.
patent: 6230131 (2001-05-01), Kuhn et al.
patent: 6233553 (2001-05-01), Contolini et al.
patent: 0 562 138 (1993-09-01), None

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

System for developing word-pronunciation pairs does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with System for developing word-pronunciation pairs, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System for developing word-pronunciation pairs will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2866833

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.