Patent
1992-09-23
1995-01-24
MacDonald, Allen R.
395 267, G10L 900
Patent
active
053848934
ABSTRACT:
A system for synthesizing a speech signal from strings of words, which are themselves strings of characters, includes a memory in which predetermined syntax tags are stored in association with entered words and phonetic transcriptions are stored in association with the syntax tags. A parser accesses the memory and groups the syntax tags of the entered words into phrases according to a first set of predetermined grammatical rules relating the syntax tags to one another. The parser also verifies the conformance of sequences of the phrases to a second set of predetermined grammatical rules relating the phrases to one another. The system retrieves the phonetic transcriptions associated with the syntax tags that were grouped into phrases conforming to the second set of rules, and also translates predetermined strings of characters into words. The system generates strings of phonetic transcriptions and prosody markers corresponding to respective strings of the words, and adds markers for rhythm and stress to the strings, which are then converted into data arrays having prosody information on a diphone-by-diphone basis. Predetermined diphone waveforms are retrieved from memory that correspond to the entered words, and these retrieved waveforms are adjusted based on the prosody information in the arrays. The adjusted diphone waveforms, which may also be adjusted for coarticulation, are then concatenated to form the speech signal. Methods in a digital computer are also disclosed.
REFERENCES:
patent: 3704345 (1972-11-01), Coker et al.
patent: 4214125 (1980-07-01), Mozer et al.
patent: 4314105 (1982-02-01), Mozer
patent: 4384170 (1983-05-01), Mozer et al.
patent: 4433434 (1984-02-01), Mozer
patent: 4435831 (1984-03-01), Mozer
patent: 4458110 (1984-07-01), Mozer
patent: 4624012 (1986-11-01), Lin et al.
patent: 4685135 (1987-08-01), Lin et al.
patent: 4692941 (1987-09-01), Jacks et al.
patent: 4695962 (1987-09-01), Goudie
patent: 4797930 (1989-01-01), Goudie
patent: 4831654 (1989-05-01), Dick
patent: 4833718 (1989-05-01), Sprague
patent: 4852168 (1989-07-01), Sprague
patent: 4872202 (1989-10-01), Fette
patent: 4896359 (1990-01-01), Yamamoto et al.
patent: 4907279 (1990-03-01), Higuchi et al.
patent: 4912768 (1990-03-01), Benbassat
patent: 4964167 (1990-10-01), Kunizawa et al.
patent: 4975957 (1990-12-01), Ichikawa et al.
D. Klatt, "Software for a Cascade/Parallel Formant Synthesizer", J. Acoust. Soc. of Amer., vol. 67, pp. 971-994 (Mar. 1980).
D. Malah, "Time-Domain Algorithms for Harmonic Bandwidth Reduction and Time Scaling of Speech Signals", IEEE Trans. on Acoustic, Speech and Signal Processing, vol. ASSP-27, pp. 121-133 (Apr. 1979).
F. Lee, "Time Compression and Expansion of Speech by the Sampling Method", J. Audio Eng'g Soc., vol. 20, pp. 738-742 (Nov. 1972).
T. Sakai et al., "On-Line, Real-Time, Multiple-Speech Output System", Proc. Int'l Fed. for Info. Processing Cong. Booklet TA-4 Ljubljana, Yugoslavia (Aug. 1971) pp. 3-7.
T. Tremain, "The Government Standard Linear Predictive Coding Algorithm: LPC-10", Speech Technology, vol. 1, No. 2, pp. 40-49 (Apr. 1982).
Doerrler Michelle
Emerson & Stern Associates, Inc.
MacDonald Allen R.
LandOfFree
Method and apparatus for speech synthesis based on prosodic anal does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for speech synthesis based on prosodic anal, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for speech synthesis based on prosodic anal will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1473796