Data processing: speech signal processing – linguistics – language – Speech signal processing – Synthesis
Reexamination Certificate
1998-10-20
2001-09-11
Dorvil, Richemond (Department: 2741)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Synthesis
C704S266000, C704S208000
Reexamination Certificate
active
06289311
ABSTRACT:
BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention relates to a method of, and an apparatus for, synthesizing a sound from coded parameters sent from a transmitter, and also to a method of, and an apparatus for, expanding the band of a narrow frequency-band sound or speech signal transmitted to a receiver from the transmitter over a communications network such as a telephone line or broadcasting network, while keeping the frequency band unchanged over the transmission path.
2. Description of Related Art
The telephone lines are regulated to use a frequency band as narrow as 300 to 3,400 Hz, for example, and the frequency band of a sound signal transmitted over the telephone network is thus limited. Therefore, the conventional analog telephone line may not be said to assure a good sound quality. This is also true for the digital portable telephone.
However, since the standards, regulations and rules for the telephone transmission path are already strictly defined, it is difficult to expand the frequency band for such specific communications. In these situations, there have been proposed various approaches to generate a wide-band signal by predicting out-of-band signal components at the receiver. Among such technical proposals, an approach to overcome such a difficulty by using a sound code book mapping is considered the best for a good sound quality. This approach is characterized by that two sound code books for sound analysis and synthesis are used to predict a spectrum envelope of a wide-band sound from a one of a narrow-band sound supplied to the receiver.
More particularly, the above approach uses the Linear Predictive Code (LPC) cepstrum, a well-known parameter for representation of a spectrum envelope, to pre-form two sound code books, one for a narrow-band sound and the other for a wide-band sound. There exist one-to-one correspondences between code vectors in these two sound code books. A narrow-band LPC cepstrum is determined from an input narrow-band sound, quantized in vector by comparison with a code vector in the narrow-band sound code book, and dequantized using a corresponding code vector in the wide-band sound code book, to thereby determine a wide-band LPC cepstrum.
For the one-to-one correspondence between the code vectors, the two sound code books are generated as will be described below. First, a wide-band learning sound is prepared, and it is limited in bandwidth to provide a narrow-band learning sound as well. The wide- and narrow-band learning sounds thus prepared are framed, respectively, and an LPC cepstrum determined from the narrow-band sound is used to first learn and generate a narrow-band sound code book. Then, frames of a learning wide-band sound corresponding to the resultant learning narrow-band sound frames to be quantized to a code vector are collected, and weighted to provide wide-band code vectors from which a wide-band sound code book is formed.
As another application of this approach, a wide-band sound code book may first be generated from the learning wide-band sound, and then corresponding learning narrow-band sound frames are weighted to provide narrow-band code vectors from which a narrow-band sound code book is generated.
Further, there has also been proposed a sound code book generation mode in which an autocorrelation is used as a parameter to be a code vector. Also, innovations are requisite for the LPC analysis and synthesis. Such innovations include a set of an impulse train and noise, an upsampled narrow-band innovation, etc.
The application of the aforementioned approaches have not succeeded in attaining a satisfactory sound quality. In particular, the sound quality is remarkably poor when the approach is applied for a sound encoded in the low bit rate sound encoding mode such as the Vector Sum Excited Linear Prediction (VSELP) mode, Pitch Synchronous Innovation-Code Excited Linear Prediction (PSI-CELP) mode or the like included in the so-called sound encoding mode CELP (Code Excited Linear Prediction) adopted in the digital telephone systems currently prevailing in Japan.
Also, the size of the memory used in generating the narrow- and wide-band sound code books is insufficient.
SUMMARY OF THE INVENTION
Accordingly, the present invention has an object to overcome the above-mentioned drawbacks of the prior art by providing a sound synthesizing method and apparatus, and a band expanding method and apparatus, adapted to provide a wide-band sound having a good quality for hearing.
To overcome the above-mentioned drawbacks of the prior art, the present invention has another object to provide a sound synthesizing method and apparatus, and a band expanding method and apparatus, adapted to save the memory capacity by using a sound code book for both sound analysis and synthesis.
The above object can be achieved by providing a sound synthesizing method in which, to synthesize a sound from plural kinds of input coded parameters, there are adopted a wide-band voiced sound code book and a wide-band unvoiced sound code book pre-formed from voiced and unvoiced sound characteristic parameters, respectively, extracted from wide-band voiced and unvoiced sounds separated at every predetermined time unit, and a narrow-band voiced sound code book and a narrow-band unvoiced sound code book pre-formed from voiced and unvoiced sound characteristic parameters extracted from a narrow-band sound obtained by limiting the frequency band of the separated wide-band voiced and unvoiced sounds, comprising, according to the present invention, the steps of
decoding the plural kinds of coded parameters;
forming an innovation from a first one of the plural kinds of decoded parameters;
converting a second decoded parameter to a sound synthesis characteristic parameter;
discriminating between the voiced and unvoiced sounds discriminable with reference to a third decoded parameter;
quantizing the sound synthesis characteristic parameter based on the result of the discrimination by using the narrow-band voiced and unvoiced sound code books;
dequantizing, by using the wide-band voiced and unvoiced sound code books, the narrow-band voiced and unvoiced sound data having been quantized using the narrow-band voiced and unvoiced sound code books; and
synthesizing a sound based on the dequantized data and innovation.
The above object can also be achieved by providing a sound synthesizing apparatus which uses, to synthesize a sound from plural kinds of input coded parameters, a wide-band voiced sound code book and a wide-band unvoiced sound code book pre-formed from voiced and unvoiced sound characteristic parameters, respectively, extracted from wide-band voiced and unvoiced sounds separated at every predetermined time unit, a narrow-band voiced sound code book and a narrow-band unvoiced sound code book pre-formed from voiced and unvoiced sound characteristic parameters extracted from a narrow-band sound obtained by limiting the frequency band of the separated wide-band voiced and unvoiced sounds, comprising, according to the present invention:
means for decoding the plural kinds of coded parameters;
means for forming an innovation from a first one of the plural kinds of parameters decoded by the decoding means;
means for obtaining a sound synthesis characteristic parameter from a second one of the coded parameters decoded by the decoding means;
means for discriminating between the voiced and unvoiced sounds with reference to a third one of the coded parameters decoded by the decoding means;
means for quantizing the sound synthesis characteristic parameter based on the result of the discrimination of the voiced and unvoiced sounds by using the narrow-band voiced and unvoiced sound code books;
means for dequantizing the quantized voiced and unvoiced sound data from the voiced and unvoiced sound quantizing means by using the wide-band voiced and unvoiced sound code books; and
means for synthesizing a sound based on the dequantized data from the wide-band voiced and unvoiced sound dequantizing means and the innovation from the innovation forming means
Nishiguchi Masayuki
Omori Shiro
Dorvil Richemond
Maioli Jay H.
Sony Corporation
LandOfFree
Sound synthesizing method and apparatus, and sound band... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Sound synthesizing method and apparatus, and sound band..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Sound synthesizing method and apparatus, and sound band... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2435919