Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2000-10-18
2004-01-20
Abebe, Daniel (Department: 2655)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S245000, C704S257000
Reexamination Certificate
active
06681206
ABSTRACT:
TECHNICAL FIELD
The invention relates to automated systems for communication recognition and understanding.
BACKGROUND OF THE INVENTION
Conventional methods for constructing spoken language systems involve collecting and annotating large speech corpora for a task. This speech is manually transcribed and each utterance is then semantically labeled. The resultant database is exploited to train stochastic language models for recognition and understanding. These models are further adapted for different dialog states. Examples of such methods are shown in U.S. Pat. Nos. 5,675,707, 5,860,063 and 6,044,337, and U.S. patent application Ser. Nos. 08/943,944, filed Oct. 3, 1997, and 09/217,635, filed Dec. 21, 1998, each of which is incorporated by reference herein in its entirety.
This transcription and labeling process is a major bottleneck in new application development and refinement of existing ones. For incremental training of a deployed natural spoken dialog system, current technology would potentially require transcribing millions of transactions. This process is both time-consuming and prohibitively expensive.
SUMMARY OF THE INVENTION
The invention concerns a method of generating morphemes for speech recognition and understanding. The method may include receiving training speech, selecting candidate sub-morphemes from the training speech, selecting salient sub-morphemes from the candidate sub-morphemes based on salience measurements, and clustering the salient sub-morphemes based on semantic and syntactic similarities into morphemes.
The morphemes may be acoustic and/or non-acoustic. The sub-morphemes may represent any sub-unit of communication including phones, phone-phrases, grammars, diphones, words, gestures, tablet strokes, body movements, mouse clicks, etc. The training speech may be verbal, non-verbal, a combination of verbal and non-verbal, or multimodal.
REFERENCES:
patent: 5675707 (1997-10-01), Gorin et al.
patent: 5794193 (1998-08-01), Gorin
patent: 5860063 (1999-01-01), Gorin et al.
patent: 6021384 (2000-02-01), Gorin et al.
patent: 6044337 (2000-03-01), Gorin et al.
patent: 6233553 (2001-05-01), Contolini et al.
patent: 6317707 (2001-11-01), Bangalore et al.
Gorin Allen Louis
Petrovska-Delacretaz Dijana
Riccardi Giuseppe
Wright Jeremy Huntley
Abebe Daniel
AT&T Corporation
LandOfFree
Method for generating morphemes does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method for generating morphemes, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for generating morphemes will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3242255