Data processing: speech signal processing – linguistics – language – Speech signal processing – Synthesis
Patent
1997-12-18
2000-05-16
Zele, Krista
Data processing: speech signal processing, linguistics, language
Speech signal processing
Synthesis
G10L 1308
Patent
active
060649607
ABSTRACT:
A method and an apparatus for improved duration modeling of phonemes in a speech synthesis system are provided. According to one aspect, text is received into a processor of a speech synthesis system. The received text is processed using a sum-of-products phoneme duration model that is used in either the formant method or the concatenative method of speech generation. The phoneme duration model, which is used along with a phoneme pitch model, is produced by developing a non-exponential functional transformation form for use with a generalized additive model. The non-exponential functional transformation form comprises a root sinusoidal transformation that is controlled in response to a minimum phoneme duration and a maximum phoneme duration. The minimum and maximum phoneme durations are observed in training data. The received text is processed by specifying at least one of a number of contextual factors for the generalized additive model. An inverse of the non-exponential functional transformation is applied to duration observations, or training data. Coefficients are generated for use with the generalized additive model. The generalized additive model comprising the coefficients is applied to at least one phoneme of the received text resulting in the generation of at least one phoneme having a duration. An acoustic sequence is generated comprising speech signals that are representative of the received text.
REFERENCES:
patent: 3704345 (1972-11-01), Coker et al.
patent: 4278838 (1981-07-01), Antonov
patent: 4896359 (1990-01-01), Yamamoto et al.
patent: 5400434 (1995-03-01), Pearson
patent: 5477448 (1995-12-01), Golding et al.
patent: 5485372 (1996-01-01), Golding et al.
patent: 5521816 (1996-05-01), Roche et al.
patent: 5535121 (1996-07-01), Roche et al.
patent: 5536902 (1996-07-01), Serra et al.
patent: 5537317 (1996-07-01), Schabes et al.
patent: 5617507 (1997-04-01), Lee et al.
patent: 5621859 (1997-04-01), Schwartz et al.
patent: 5729694 (1998-03-01), Holzrichter et al.
patent: 5799269 (1998-08-01), Schabes et al.
patent: 5799276 (1998-08-01), Komissarchik et al.
Harris, "On the Use fo Windows for Harmonic Analysis with the DFT", Proceedings of the IEEE, vol. 66, #1, Jan. 1978.
Bellegarda Jerome R.
Silverman Kim
Apple Computer Inc.
Opsasnick Michael N.
Zele Krista
LandOfFree
Method and apparatus for improved duration modeling of phonemes does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for improved duration modeling of phonemes, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for improved duration modeling of phonemes will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-267577