Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Patent
1996-03-22
1997-09-16
MacDonald, Allen R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
G10L 506
Patent
active
056689268
ABSTRACT:
Text may be converted to audible signals, such as speech, by first training a neural network 106 using recorded audio messages 204. To begin the training, the recorded audio messages are converted into a series of audio frames 205 having a fixed duration 213. Then, each audio frame is assigned a phonetic representation 203 and a target acoustic representation 208, where the phonetic representation 203 is a binary word that represents the phone and articulation characteristics of the audio frame, while the target acoustic representation 208 is a vector of audio information such as pitch and energy. After training, the neural network 106 is used in conversion of text into speech. First, text that is to be convened is translated to a series of phonetic frames 401 of the same form as the phonetic representations 208 and having the fixed duration 213. Then the neural network produces acoustic representations in response to context descriptions 207 that include some of the phonetic frames 401. The acoustic representations are then converted into a speech wave form by a synthesizer 107.
REFERENCES:
patent: 3632887 (1972-01-01), Leipp et al.
patent: 3704345 (1972-11-01), Coker et al.
patent: 5041983 (1991-08-01), Nakahara et al.
patent: 5163111 (1992-11-01), Baji et al.
Weijters et al, "Speech Synthesis with Artificial Neural Networks", Int'l Conf on Acoustics, Speech & Signal Processing, Mar. 28-Apr. 1, 1993, pp. 1764-1769 vol. 1.
Scordilis et al, "Text Processing for Speech Synthesis Using Parallel Distributed Models", 1989 IEEE Proc, Apr. 9-12 1989, pp. 765-769 vol. 2.
Tuerk et al, "The Development of a Connectionist Multiple Voice Text-to-Speech System", Int'l Conf on Acoustics Speech & Signal Processing, May 14-17 1991 pp. 749-752 vol. 2.
"Speech Synthesis with Artificial Neural Networks"; Ton Weijters, Johan Thole 1993 IEEE International Conference on Neural Networks, San Francisco, CA Mar. 28-Apr. 1, vol. 3, pp. 1264-1269.
Corrigan Gerald Edward
Gerson Ira Alan
Karaali Orhan
MacDonald Allen R.
Motorola Inc.
Stockley Darleen J.
Wieland Susan
LandOfFree
Method and apparatus for converting text into audible signals us does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for converting text into audible signals us, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for converting text into audible signals us will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-224497