Data processing: speech signal processing – linguistics – language – Speech signal processing – Synthesis
Patent
1997-09-09
2000-06-27
Hudspeth, David R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
Synthesis
704269, G10L 1302
Patent
active
06081781&
ABSTRACT:
Data in the same range of the fundamental frequency F.sub.0 as speech segments are used as learning data to prepare a reference codebook CB.sub.M for a spectrum envelope. The same learning data for a higher range than F.sub.0 and the same learning data for a lower range are subject to a linear stretch matching with respect to the learning data for the range F.sub.0. For each vector code in the reference codebook CB.sub.M, the spectrum envelope is clustered to prepare a high range codebook CB.sub.H and a low range codebook CB.sub.L. The spectrum envelope of input speech segments are fuzzy vector quantized (S402) with the reference codebook, and depending on the synthesized F.sub.0, a high, middle or low codebooks is selected. The selected codebook is used to decode the fuzzy vector quantized code, and the decoded output is subject to the inverse FFT. Alternatively, codebooks CM.sub.MH and CB.sub.ML each comprising differential vectors for corresponding code vectors between CB.sub.M and CB.sub.H and between CB.sub.M and CB.sub.L are prepared. The quantized code is decoded using either CB.sub.MH or CB.sub.ML, and the decoded differential vector is stretched in accordance with a difference in the fundamental frequency between the synthesized speech and the original speech for CB.sub.M. The stretched differential vector is added to the code vector which was used for the fuzzy vector quantization.
REFERENCES:
patent: 5077798 (1991-12-01), Ichikawa et al.
patent: 5151968 (1992-09-01), Tanaka et al.
patent: 5231671 (1993-07-01), Gibson et al.
patent: 5327521 (1994-07-01), Savic et al.
patent: 5384891 (1995-01-01), Asakawa et al.
patent: 5428708 (1995-06-01), Gibson et al.
patent: 5641926 (1997-06-01), Gibson et al.
patent: 5717819 (1998-02-01), Emeott et al.
patent: 5740320 (1998-04-01), Itoh
patent: 5745650 (1998-04-01), Otsuka et al.
Asakawa et al., "A 2.4 KBPS speech coding method based on fuzzy vector quantization," 1990 International Conference on Acoustics, Speech, and Signal Processing, vol. 2, pp. 673-676, Apr. 1990.
Asakawa et al., "Speech coding method using fuzzy vector quantization," 1989 International Conference on Acoustics, Speech, and Signal Processing, vol. 2, pp. 755-758, Apr. 1989.
Tanaka, K. and Abe, M., "A New Fundamental Frequency Modification Algorithm with Transformation of Spectrum Envelope According to F.sub.0," IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 2, Apr. 21-24, 1997, pp. 951-954.
Valbret, H., et al., "Voice Transformation using PSOLA Technique," Speech Communication, vol. 11, Nos. 2/3, Jun. 1992, pp. 175-187.
Abe, M., et al., "Voice Conversion Through Vector Quantization," IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, Apr. 11-14, 1998, pp. 655-658.
Shikano, K., et al., "Speaker Adaptation and Voice Conversion by Codebook Mapping," IEEE International Symposium on Circuits and Systems, vol. 1, Jun. 11-14, 1991, pp. 594-597.
Yoshida, Y., and Abe, M., "An Algorithm to Reconstruct Wideband Speech from Narrowband Speech Based on Codebook Mapping," Proceedings of the International Conference on Spoken Language Processing, Sep. 18, 1994, pp. 1591-1594.
Matsumoto, H. and Inoue, H., "A Minimum Distortion Spectral Mapping Applied to Voice Quality Conversion," Proceedings of the International Conference on Spoken Language Processing, Nov. 18, 1990, pp. 161-164.
Abe Masanobu
Tanaka Kimihito
Hudspeth David R.
Lerner Martin
Nippon Telegragh and Telephone Corporation
LandOfFree
Method and apparatus for speech synthesis and program recorded m does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for speech synthesis and program recorded m, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for speech synthesis and program recorded m will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1792475