Automated voice synthesis from text having a restricted known in

Data processing: speech signal processing – linguistics – language – Speech signal processing – Synthesis

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

704258, 704267, 704268, G10L 300

Patent

active

058901175

ABSTRACT:
Improved automated synthesis of human audible speech from text is disclosed. Performance enhancement of the underlying text comprehensibility is obtained through prosodic treatment of the synthesized material, improved speaking rate treatment, and improved methods of spelling words or terms for the system user. Prosodic shaping of text sequences appropriate for the discourse in large groupings of text segments, with prosodic boundaries developed to indicate conceptual units within the text groupings, is implemented in a preferred embodiment.

REFERENCES:
patent: 3704345 (1972-11-01), Coker et al.
patent: 4470150 (1984-09-01), Ostrowski
patent: 4685135 (1987-08-01), Lin et al.
patent: 4689817 (1987-08-01), Kroon
patent: 4692941 (1987-09-01), Jacks et al.
patent: 4695962 (1987-09-01), Goudie
patent: 4783810 (1988-11-01), Kroon
patent: 4783811 (1988-11-01), Fisher et al.
patent: 4829580 (1989-05-01), Church
patent: 4831654 (1989-05-01), Dick
patent: 4896359 (1990-01-01), Yamamoto et al.
patent: 4907279 (1990-03-01), Higuchi et al.
patent: 4908867 (1990-03-01), Silverman
patent: 4912768 (1990-03-01), Benbassat
patent: 4964167 (1990-10-01), Kunizawa et al.
patent: 4979216 (1990-12-01), Maisheen et al.
patent: 5040218 (1991-08-01), Vitale et al.
patent: 5204905 (1993-04-01), Mitome
patent: 5212731 (1993-05-01), Zimmermann
patent: 5384893 (1995-01-01), Hutchins
patent: 5475796 (1995-12-01), Iwata
patent: 5617507 (1997-04-01), Lee et al.
patent: 5636325 (1997-06-01), Farrett
Taylor et al, "An interactive synthetic speech generation system," IEE Colloquim on `systems and applications of man-machine interaction using speech i/o`, p. 6/1-3, Mar. 1991.
Bachenko et al, "Prosodic phrasing for speech synthesis of written telecommunications by the deaf," IEEE Global telecommunications Conference. Globecom '91, p. 1391-1395 vol. 2, Dec. 1991.
Chen et al, "A first study of neural net based generation of prosodic and spectral information for mandrin text-to-speech," ICASSP-92, p. 45-48 vol. 2, Mar. 1992.
Julia Hirshberg and Janet Pierrehumbert, "The Intonational Structuring of Discourse", Association of Computational Linguistics: 1986 (ACL-86) pp. 1-9.
J. S. Young, F. Fallside, "Synthesis by Rule of Prosodic Features in Word Concatenation Synthesis", Int. Journal Man-Machine Studies, (1980) V12, pp. 241-258.
A.W.F. Huggins, "Speech Timing and Intelligibility", Attention and Performance VII, Hillside, NJ: Erlbaum 1978, pp. 279-297.
S.J. Young and F. Fallside, "Speech Synthesis from Concept: A Method for Speech Output From Information Systems", J. Acoust. Soc. Am. 66(3) Sep. 1979 pp. 685-695.
B.G. Green, J.S. Logan, D.B. Pisoni, "Perception of Synthetic Speech Produced Automatically by Rule: Intelligibility of Eight Text-to-Speech Systems", Behavior Research Methods, Instruments & Computers, V18, 1986, pp. 100-107.
B.G. Greene, L.M. Manous, D.B. Pisoni, "Perceptual Evaluation of DECtalk: A Final Report on Version 1.8*", Research on Speech Perception Progress Report No. 10, Bloomington, IN. Speech Research Laboratory, Indiana University (1984), pp. 77-127.
Kim E.A. Silverman, Doctoral Thesis, "The Structure and Processing of Fundamental Frequency Contours", University of Cambridge (UK) 1987.
J.C. Thomas and M.B. Rosson, "Human Factors and Synthetic Speech", Human Computer Interaction--Interact '84, North Holland Elsevier Science Publishers (1984) pp. 219-224.
Y. Sagisaka, "Speechy Synthesis From Text", IEEE Communications Magazine, vol. 28, iss 1, Jan. 1990, pp. 35-41.
E. Fitzpatrick and J. Bachenko, "Parsing for Prosody: What a Texto-to-Speech System Needs from Syntax", pp. 188-194, 27-31 Mar. 1989.
Moulines et al., "A Real-Time French Text-to-Speech System Generating High-Quality Synthetic Speech", ICASSP 90, pp. 309-312, vol. 1, 3-6 Apr. 1990.
Wilemse et al, "Context Free Card Parsing In A Text-To-Speech System", ICASSP 91, pp. 757-760, vol. 2, 14-17 May, 1991.
James Raymond Davis and Julia Hirschberg, "Assigning Intonational Features in Synthesized Spoken Directions", 26th Annual Meeting of Assoc. Computational Lingustistics; 1988, pp. 1-9.
K. Silverman, S. Basson, S. Levas, "Evaluating Synthesizer Performance: Is Segmental Intelligibility Enough", International Conf. on spoken Language Processing, 1990.
J. Allen, M.S. Hunnicutt, D. Klatt, "From Text to Speech: The MIT Talk System", Cambridge University Press, 1987.
T. Boogaart, K. Silverman, "Evaluating the Overall Comprehensibility of speech Synthesizers", Proc. Int'l Conference on Spoken Language Processing, 1990.
K. Silverman, S. Basson, S. Levas, "On Evaluating Synthetic Speech: What Load Does It Place on a Listener's Cognitive Resources", Proc. 3rd Austal. Int'l Conf. Speech Science & Technology, 1990.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Automated voice synthesis from text having a restricted known in does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Automated voice synthesis from text having a restricted known in, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Automated voice synthesis from text having a restricted known in will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-1225294

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.