Methods for controlling the generation of speech from text repre

Data processing: speech signal processing – linguistics – language – Speech signal processing – Synthesis

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

704 9, 704266, G10L 502

Patent

active

058324356

ABSTRACT:
Improved automated synthesis of human audible speech from text is disclosed. Performance enhancement of the underlying text comprehensibility is obtained through prosodic treatment of the synthesized material, improved speaking rate treatment, and improved methods of spelling words or terms for the system user. Prosodic shaping of text sequences appropriate for the discourse in large groupings of text segments, with prosodic boundaries developed to indicate conceptual units within the text groupings, is implemented in a preferred embodiment.

REFERENCES:
patent: 3704345 (1972-11-01), Coker et al.
patent: 4470150 (1984-09-01), Ostrowski
patent: 4685135 (1987-08-01), Lin et al.
patent: 4689817 (1987-08-01), Kroon
patent: 4692941 (1987-09-01), Jacks et al.
patent: 4695962 (1987-09-01), Goudie
patent: 4783810 (1988-11-01), Kroon
patent: 4783811 (1988-11-01), Fisher et al.
patent: 4829580 (1989-05-01), Church
patent: 4831654 (1989-05-01), Dick
patent: 4896359 (1990-01-01), Yamamoto et al.
patent: 4907279 (1990-03-01), Higuchi et al.
patent: 4908867 (1990-03-01), Silverman
patent: 4912768 (1990-03-01), Benbassat
patent: 4964167 (1990-10-01), Kunizawa et al.
patent: 4979216 (1990-12-01), Maisheen et al.
patent: 5040218 (1991-08-01), Vitale et al.
patent: 5204905 (1993-04-01), Mitome
patent: 5212731 (1993-05-01), Zimmermann
patent: 5384893 (1995-01-01), Hutchins
patent: 5475796 (1995-12-01), Iwata
patent: 5615300 (1997-03-01), Hara et al.
patent: 5617507 (1997-04-01), Lee et al.
patent: 5636325 (1997-06-01), Farrett
patent: 5673362 (1997-09-01), Matsumoto
Taylor et al, "An interactive synthetic speech generation system," IEE Colloquim on `systems and applications of man-machine interaction using speech i/o`, p. 6/1-3, Mar. 1991.
Bachenko et al, "Prosodic phrasing for speech synthesis of written telecommunications by the deaf," IEEE Global telecommunications Conference. Globecom '91, pp. 1391-5 vol. 2, Dec. 1991.
Chen et al, "A first study of neural net based generation of prosodic and spectral information for mandrin text-to-speech," ICASSP-92, pp. 45-8 vol. 2, Mar. 1992.
Bang et al, "A text-to-speech system for spanish with a frequency domain based prosodic modification algorithm," ICASSP '93, pp. II-183--II-186, Apr. 1993.
Chen et al, "Word recognition based on the combination of a sequential neural network and the GPDM discriminative training algorithm," Neural Networks for Signal Processing. Proceedings of the 1991 IEEE Workshop, pp. 376-84, Oct. 1991.
Hwang et al, "Neural-network based FO text-to-speech synthesizer for Mandarin," IEE Proceedings-Vision, Image, and Signal Processing, vol. 141, iss. 6, pp. 384-90, Dec. 1994.
Julia Hirschberg and Janet Pierrehumbert, "The Intonational Structuring of Discourse", Association of Computational Linguistics: 1986 (ACL-86) pp. 1-9.
J.S. Young, F. Fallside, "Synthesis by Rule of Prosodic Features in Word Concatenation Synthesis", Int. Journal Man-Machine Studies, (1980) v12, pp. 241-258.
A.W.F. Huggins, "speech Timing and Intelligibility", Attention and Performance VII, Hillsdale, NJ: Erlbaum 1978, pp. 279-297.
S.J. Young and F. Fallside, "Speech Synthesis from Concept: A Method for Speech Output From Information Systems", J. Acoust. Soc. Am. 66(3), Sep. 1979, pp. 685-695.
B.G. Green, J.S. Logan, D.B. Pisoni, "Perception of Synthetic Speech Produced Automatically by Rule: Intelligibility of Eight Text-to-Speech Systems", Behavior Research Methods, Instruments & Computers, v18, 1986, pp. 100-107.
B.G. Greene, L.M. Manous, D.B. Pisoni, "Perceptual Evaluation of DECtalk: A Final Report on Version 1.8*", Research on Speech Perception Progress Report No. 10, Bloomington, IN. Speech Research Laboratory, Indiana University (1984), pp. 77-127.
Kim E.A. Silverman, Doctoral Thesis, "The Structure and Processing of Fundamental Frequency Contours", University of Cambridge (UK) 1987.
J.C. Thomas and M.B. Rosson, "Human Factors Synthetic Speech", Human Computer Interaction--INTERACT '84, North Holland Elsevier Science Publishers (1984) pp. 219-224.
Y. Sagisaka, "Speech Synthesis From Text", IEEE Communications Magazine, vol. 28, iss 1, Jan. 1990, pp. 35-41.
E. Fitzpatrick and J. Bachenko, "Parsing for Prosody: What a Text-to-Speech System Needs from Syntax", pp. 188-194, 27-31 Mar. 1989.
Moulines et al., "A Real-Time French Text-To-Speech System Generating High-Quality Synthetic Speech", ICASSP 90, pp. 309-312, vol. 1, 3-6 Apr. 1990.
Wilemse et al, "Context Free Card Parsing In A Text-To-Speech System", ICASSP 91, pp. 757-760, vol. 2, 14-17 May, 1991.
James Raymond Davis and Julia Hirschberg, "Assigning Intonational Features in Synthesized Spoken Directions", 26th Annual Meeting of Assoc. Computational Lingustistics; 1988, pp. 1-9.
K. Silverman, S. Basson, S. Levas, "Evaluating Synthesizer Performance: Is Segmental Intelligibility Enough", International Conf. on spoken Language Processing, 1990.
J. Allen, M.S. Hunnicutt, D. Klatt, "From Text to Speech: The MIT Talk System", Cambridge University Press, 1987.
T. Boogaart, K. Silverman, "Evaluating the Overall Comprehensibility of speech Synthesizers", Proc. Int'l Conference on Spoken Language Processing, 1990.
K. Silverman, S. Basson, S. Levas, "On Evaluating Synthetic Speech: What Load Does It Place on a Listener's Cognitive Resources", Proc. 3rd Austal. Int'l Conf. Speech Science & Technology, 1990.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Methods for controlling the generation of speech from text repre does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Methods for controlling the generation of speech from text repre, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Methods for controlling the generation of speech from text repre will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-703574

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.