Data processing: speech signal processing – linguistics – language – Speech signal processing – Synthesis
Patent
1998-04-28
2000-09-12
Hudspeth, David R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
Synthesis
704235, 704249, 704260, G10L 1306, G10L 1308, G10L 1526
Patent
active
061190861
ABSTRACT:
A speech coding system, responsive to an input speech signal provided by a system user, comprises: a speech coding portion including a speech recognition system responsive to the input speech signal and having a word vocabulary associated therewith, the speech recognition system recognizing the input speech signal in accordance with the vocabulary and generating phonetic tokens, such as at least one sequence of lefemes, representative of the input speech signal; a channel, responsive to the at least one sequence of lefemes, for transmitting and/or storing the at least one sequence of lefemes; and a speech synthesizing portion, responsive to the transmitted/stored sequence of lefemes, for generating a synthesized speech signal which is representative of the input speech signal provided by the system user using the at least one sequence of lefemes. The speech recognition system preferably generates acoustic parameters from the input speech signal which include voice characteristics of the system user. The speech coding system also preferably comprises a labeler which processes the input speech signal including words uttered by the system user which are not in the word vocabulary associated with the speech recognition system, the labeler generating phonetic tokens, such as at least one sequence of lefemes, optimally representative of the input speech signal. The sequence of lefemes from the labeler and the speech recognition portion are compared, for each speech segment, and the sequence most similar to the input speech is selected for transmission/storage. The speech synthesizing portion of the system preferably performs speech synthesis using pre-enrolled phonetic sub-units or tokens.
REFERENCES:
patent: 4424415 (1984-01-01), Lin
patent: 4473904 (1984-09-01), Suehiro et al.
patent: 4661915 (1987-04-01), Ott
patent: 4707858 (1987-11-01), Fette
patent: 5305421 (1994-04-01), Li
patent: 5524051 (1996-06-01), Ryan
patent: 5696879 (1997-12-01), Cline et al.
patent: 5832425 (1998-11-01), Mead
D. A. Reynolds and L. P. Heck, "Integration of Speaker and Speech Recognition Systems," Proc. IEEE ICASSP 91, p. 869-872, Apr. 1991.
Ittycheriah Abraham
Maes Stephane H.
Nahamoo David
Hudspeth David R.
International Business Machines - Corporation
Smits Talivaldis Ivars
LandOfFree
Speech coding via speech recognition and synthesis based on pre- does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Speech coding via speech recognition and synthesis based on pre-, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech coding via speech recognition and synthesis based on pre- will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-104921