Speech synthesis using perceptual linear prediction parameters

Electrical audio signal processing systems and devices – Hearing aids – electrical – Specified casing or housing

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

381 51, 381 53, 381 36, G10L 502, G10L 910, G10L 500

Patent

active

051650088

ABSTRACT:
A method for synthesizing human speech using a linear mapping of a small set of coefficients that are speaker-independent. Preferably, the speaker-independent set of coefficients are cepstral coefficients developed during a training session using a perceptual linear predictive analysis. A linear predictive all-pole model is used to develop corresponding formants and bandwidths to which the cepstral coefficients are mapped by using a separate multiple regression model for each of the five formant frequencies and five formant bandwidths. The dual analysis produces both the cepstral coefficients of the PLP model for the different vowel-like sounds and their true formant frequencies and bandwidths. The separate multiple regression models developed by mapping the cepstral coefficients into the formant frequencies and formant bandwidths can then be applied to cepstral coefficients determined for subsequent speech to produce corresponding formants and bandwidths used to synthesize that speech. Since less data are required for synthesizing each speech segment than in conventional techniques, a reduction in the required storage space and/or transmission rate for the data required in the speech synthesis is achieved. In addition, the cepstral coefficients for each speech segment can be used with the regressive model for a different speaker, to produce synthesized speech corresponding to the different speaker.

REFERENCES:
patent: 4051331 (1977-09-01), Strong et al.
patent: 4130730 (1978-12-01), Ostrowski
patent: 4763278 (1988-08-01), Rajasekaran et al.
patent: 4829573 (1989-05-01), Gagnon et al.
patent: 4882758 (1989-11-01), Uekawa et al.
patent: 4908865 (1990-03-01), Doddington et al.
patent: 4914702 (1990-04-01), Taguchi
"Linear Prediction: A Tutorial Review" by John Makhoul, Reprinted from Proc of IEEE vol. 63 Apr. 1975, May 17, 1988.
"Linear Prediction with a Variable Analysis Frame Size" by Chandra et al., IEEE Trans on ASSP Aug. 1977.
Broad, David J., et al., Formant Estimation by Linear Transformation of the LPC Cepstrum, Reprinted from The Journal of the Acoustical Society of America, vol. 86, No. 5, Nov. 1989, pp. 2013-2017.
Hermansky, H., Perceptual Linear Predictive (PLP) Analysis of Speech, J. Acoust. Soc. Am. 87(4), Apr. 1990, copyright 1990, Acoustical Society of America, pp. 1738-1752.
Hermansky, H., et al., The Effective Second Formant F2' and the Vocal Tract Front-Cavity, ICASSP-89, Glasgow, Scotland, CH2673-Feb. 1989, copyright 1989 IEEE, pp. 480-483.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Speech synthesis using perceptual linear prediction parameters does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Speech synthesis using perceptual linear prediction parameters, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech synthesis using perceptual linear prediction parameters will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-1178025

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.