Front-end architecture for a multi-lingual text-to-speech...

Data processing: speech signal processing – linguistics – language – Linguistics – Translation machine

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S002000, C704S003000, C704S260000, C704S270000, C704S277000, C715S252000

Reexamination Certificate

active

07496498

ABSTRACT:
A text processing system for processing multi-lingual text for a speech synthesizer includes a first language dependent module for performing at least one of text and prosody analysis on a portion of input text comprising a first language. A second language dependent module performs at least one of text and prosody analysis on a second portion of input text comprising a second language. A third module is adapted to receive outputs from the first and second dependent module and performs prosodic and phonetic context abstraction over the outputs based on multi-lingual text.

REFERENCES:
patent: 4718094 (1988-01-01), Bahl
patent: 5146405 (1992-09-01), Church
patent: 5384893 (1995-01-01), Hutchins
patent: 5440481 (1995-08-01), Kostoff
patent: 5592585 (1997-01-01), Van Coile et al.
patent: 5727120 (1998-03-01), Van Coile et al.
patent: 5732395 (1998-03-01), Alexander Silverman
patent: 5839105 (1998-11-01), Ostendorf et al.
patent: 5857169 (1999-01-01), Seide
patent: 5890117 (1999-03-01), Silverman
patent: 5905972 (1999-05-01), Huang et al.
patent: 5912989 (1999-06-01), Watanabe
patent: 5933806 (1999-08-01), Beyerlein
patent: 5937422 (1999-08-01), Nelson
patent: 6064960 (2000-05-01), Bellegarda et al.
patent: 6076060 (2000-06-01), Lin et al.
patent: 6101470 (2000-08-01), Eide et al.
patent: 6141642 (2000-10-01), Oh
patent: 6151576 (2000-11-01), Warnock et al.
patent: 6172675 (2001-01-01), Ahmad
patent: 6185533 (2001-02-01), Holm et al.
patent: 6230131 (2001-05-01), Kuhn et al.
patent: 6401060 (2002-06-01), Critchlow et al.
patent: 6499014 (2002-12-01), Chihara
patent: 6505158 (2003-01-01), Conkie
patent: 6665641 (2003-12-01), Coorman et al.
patent: 6708152 (2004-03-01), Kivimaki
patent: 6751592 (2004-06-01), Shiga
patent: 6829578 (2004-12-01), Huang et al.
patent: 6978239 (2005-12-01), Chu et al.
patent: 7010489 (2006-03-01), Lewis et al.
patent: 2002/0072908 (2002-06-01), Case et al.
patent: 2002/0103648 (2002-08-01), Case et al.
patent: 2002/0152073 (2002-10-01), DeMoortel et al.
patent: 2003/0208355 (2003-11-01), Stylianou et al.
patent: 0 984 426 (2000-03-01), None
patent: 1213705 (2002-06-01), None
Huang X et al., “Recent Improvements on Microsoft's Trainable Text-To-Speech System-Whistler,” Acoustics, Speech and Signal Processing, 1997, pp. 959-962.
Hunt A et al., “Unit Selection in a Concatenative Speech Synthesis System Using a Large Speech Database,” IEEE International Conference on Acoustics, Speech and Signal Processing, 1996, pp. 373-376.
Tien Ying Fung et al., “Concatenating Syllables for Response Generation in Spoken Language Applications,” IEEE International Conference on Acoustics, Speech and Signal Processing, 2000, pp. 933-936.
Fu-Chiang Chou et al., “A Chinese Text-To-Speech System Based on Part-of-Speech Analysis, Prosodic Modeling and Non-Uniform Units,” Acoustics, Speech, and Signal Processing, 1997, pp. 923-926.
Bigorgne D. et al., “Multilingual PSOLA Text-To-Speech System,” Statistical Signal and Array Processing, Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, 1993, pp. 187-190.
Nakajima S et al., “Automatic Generation of Synthesis Units Based on Context Oriented Clustering,” International Conference on Acoustics, Speech and Signal Processing, 1988, pp. 659-662.
Black A W et al. “Optimising Selection of Units from Speech Databases for Concatenative Synthesis,” 4thEuropean Conference on Speech Communication and Technology Eurospeech, 1995, pp. 581-584.
P.B. Mareuil and B. Soulage, “Input/output normalization and linguistic analysis for a multilingual text-to-speech Synthesis System,” Proc. of 4thISCA workshop on speech synthesis, Scotland, 2001.
http;//www.research.att.com/projects/tts/, copyright 2003.
D.H. Klatt, “The Klattalk text-to-speech conversion system,” Proc. of ICASSP '82, pp. 1589-1592, 1982.
H. Fujisaki, K. Hirose, N. Takahashi and H. Morikawa, “Acoustic characteristics and the underlying rules of intonation of the common Japanese used by radio and TV announcers,”Proc. of ICASSP'86, pp. 2039-2042, 1986.
K.N. Ross and M. Ostendorf, “A dynamical system model for generating fundamental frequency for speech synthesis,” IEEE transactions on speech and audio processing, vol. 7, No. 3, pp. 295-309, 1999.
J.R. Bellegarda, K. Silverman, K. Lenzo, and V. Anderson, “Statistical prosodic modeling: from corpus design to parameter estimation,” IEEE transactions on speech and audio processing, vol. 9, No. 1, pp. 52-66, 2001.
S. Chen, S. Hwang and Y. Wang, “An RNN-based prosodic information synthesizer for Mandarin text-to-speech,” IEEE transactions on speech and audio processing, vol. 6, No. 3, pp. 226-239, 1998.
M. Chu, H. Peng, H. Yang and E. Chang, “Selecting non-uniform units from a very large corpus for concatenative speech synthesizer,”Proc. of ICASSP'2001, Salt Lake City, 2001.
E. Moulines and F. Charpentier, “Pitch-Synchronous Waveform Processing Techniques for Text-to-Speech Synthesis Using Diphones,” Speech Communication vol. 9, pp. 453-467, 1990.
Y. Stylianou, T. Dutoit, and J. Schroeter, “Diphone concatenation using a harmonic plus noise model of speech,” Proc. of Eurospeech '97, pp. 613-616, Rhodes, 1997.
X.D. Huang, A. Acero, J. Adcock, et al., “Whistler: a trainable text-to-speech system,” Proc. of 'ICSLP '96, Philadelphia, 1996.
R.E. Donovan and E.M. Eide, “The IBM trainable speech synthesis system,” Proc. of ICSLP '98, Sidney, 1998.
H. Peng, Y. Zhao and M. Chu, “Perpetually optimizing the cost function for unit selection in a TTS system with one single run of MOS evaluation,” Proc. of ICSLP '2002, Denver, 2002.
M. Chu and H. Peng, “An objective measure for estimating MOS of synthesized speech,” Proc. of Eurospeech '2001, Aalborg, 2001.
http://www.microsoft.com/speech/techinfo/compliance/, Apr. 3, 2002.
Wang, et al. “Tree-Based Unit Selection for English Speech Synthesis,” ICASSP '93, vol. 2, pp. 191-194 (1993).
Huang, X., Lou, Z. and Tang, J., “A Quick Method for Chinese Word Segmentation,” Intelligent Processing Systems, vol. 2, pp. 1773-1776 (1997).
Wong, P. and Chan, C., “Chinese Word Segmentation Based on Maximum Matching and Word Binding Force,” Coling'96, Copenhagen (1996).
Hon, H., Acero, A., Huang, S., Liu, J. and Plumpe, M., “Automated Generation of Synthesis Units for Trainable Text-to-Speech Systems,” ICASSP'98, vol. 1, pp. 293-296 (1998).
Black, A. and Campbell, N., “Unit Selection in a Concatentaive Speech Synthesis System Using a Large Speech Database,” ICASSP'96, pp. 373-376 (1996).
Chu, M., Tang, D., Si, H., Tian, Z. and Lu, S., “Research on Perception of Junction Between Syllables in Chinese,” Chinese Journal of Acoustics, vol. 17, No. 2, pp. 143-152.
M. Chu, H. Peng and E. Chang, “A concatenative Mandarin TTS system without prosody model and prosody modification”, Proceedings of 4thISCA workshop on speech synthesis, Scotland, 2001.
Campbell, Nick: “Foreign-Language Speech Synthesis” Proceedings 3rdEsca-Cocosda Int'l Workshop In Speech Synthesis, Nov. 26, 1998, pp. 177-180.
Campbell, Nick: “Talking Foreign Concatenative Speech Synthesis and the Language Barrier”, Eurospeech 2001, vol. 1, 2001. p. 337.
Sproat et al: “Emu: an e-mail preprocessor for text-to-speech” Multimedia Signal Processing, 1998 IEEE Second Workshop on Redondo Beach, CA, Dec. 7-9, 1998.
Sproat R Ed, et al., “Multilingual text analysis for text-to-speech synthesis”, Spoken Language, 1996. ICSLP 96 Proceedings, 4thInternational Conference in Philadelphia, PA, Oct. 3-6, 1996, pp. 1365-1368.
Christof Traber et al., “From Multilingual To Polyglot Speech Synthesis” Eurospeech 1999, vol. 2, 1999, p. 835.
Huber, H. et al., “Possy: EIN Projekt Zur Realisierung Einer Polyglotten Spr

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Front-end architecture for a multi-lingual text-to-speech... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Front-end architecture for a multi-lingual text-to-speech..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Front-end architecture for a multi-lingual text-to-speech... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4113307

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.