Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Reexamination Certificate
1998-09-16
2001-06-26
Tsang, Fan (Department: 2645)
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
C704S262000
Reexamination Certificate
active
06253172
ABSTRACT:
TECHNICAL FIELD OF THE INVENTION
This invention relates to spectral transformation of acoustic signals.
BACKGROUND OF THE INVENTION
In a number of important applications it is desirable to carry out spectral transformations on acoustical signals. In speech signal processing, the speech may be compressed or expanded in frequency. In particular, frequency compression is useful in bandwidth reduction or in placing the speech into a desired frequency range as an aid to the hearing impaired. Another speech application requires that the fundamental frequency of the speaker be modified while preserving the shape of the envelope of the short-time speech spectrum. This operation is useful in psychoacoustic research and in correcting pitch discontinuities in concatenated speech segments. In musical signal processing, in order to synthesize all individual notes across the entire range of a particular musical instrument, a common practice is to analyze some of the original notes and store their parameters. At the synthesis stage, all other notes are obtained from the analyzed notes by pitch shifting. Generally speaking, in a sampler or a wavetable synthesizer, one original sound waveform is stored for every three or four notes. The pitch shifting is accomplished by sample rate conversion. It is well known that the pitch shifting through sample rate conversion preserves the original signal waveform, but creates two undesired effects. One is that it “compresses” the signal spectrum so that the pitch-shifted signal sounds “darker”. To avoid aliasing, the pitch is always shifted down in samplers or wavetable synthesizers. The other one is that since the signal waveform shape is not changed among adjacent notes, musical sounds synthesized by a sampler or a wavetable synthesizer lack variations from note to note, and thus lack the realism of musical instruments. To improve the brightness and the realism of pitch-shifted signals, researchers are trying to use the result from speech signal analysis and synthesis, that is, trying to preserve the signal spectrum envelope when the original signal is pitch-shifted. Even though the physical reason of such use remains to be justified, it is widely accepted that the brightness of pitch-shifted signals does get improved by preserving the shape of the signal spectrum envelope.
A prior art frequency-domain approach is described by Quatieri, et al. in an article entitled, “Speech Transformations based on a Sinusoidal Representation,” IEEE Transactions on Acoustics, Speech, and Signal Processing, Vol. 34, pp. 1449-1464, December 1989. Assume s(t) is the signal to be pitch-shifted by a factor &bgr;. According to Quatieri, et al., the pitch shifting or frequency transformation is performed as follows. First, a transfer function
H(&ohgr;, t)=M(&ohgr;, t) exp [j&PHgr;(&ohgr;, t)]
is obtained. (In practice, only uniform samples of H(&ohgr;, t) from the Discrete Fourier Transform (DFT) are available and stored. The magnitude response of this transfer function, H(&ohgr;, t), is a good approximation to the spectrum envelope of the signal s(t). The phase function, &PHgr;(&ohgr;, t), is the Hilbert transform of M(&ohgr;, t). So the transfer function H(&ohgr;, t) represents a minimum phase system. The socalled excitation signal e(t) can then be obtained by filtering s(t) through the inverse system of H(&ohgr;, t). The excitation signal e(t) can be expressed using a sinusoidal model as
e
⁡
(
t
)
=
∑
t
L
⁢
a
l
⁡
(
t
)
⁢
cos
⁢
[
∫
0
t
⁢
ω
l
⁡
(
σ
)
⁢
ⅆ
σ
+
η
l
]
When a pitch modification is needed, each sine-wave component of the excitation signal is scaled by a desired factor &bgr; to generate a new frequency track at &bgr;&ohgr;
l
(t). The excitation amplitude a
l
(t) is then shifted to the new frequency track location. To preserve the shape of the spectrum envelope, the amplitudes and phases of H(&ohgr;, t) must be computed at the new frequency track location &bgr;&ohgr;
l
(t). They are obtained by sampling (interpolation in frequency) M(&ohgr;, t) and &PHgr;(&ohgr;, t), respectively.
With the above modified excitation and system magnitudes and phases, the resulting modified signal waveform, denoted as {tilde over (s)}(t, &bgr;), is given by
s
~
⁡
(
t
,
β
)
=
∑
l
L
⁢
a
l
⁡
(
t
)
⁢
M
⁡
(
β
⁢
⁢
ω
l
,
t
)
⁢
cos
⁢
{
∫
0
t
⁢
β
⁢
⁢
ω
l
⁡
(
σ
)
⁢
ⅆ
σ
+
η
l
+
Φ
⁡
(
βω
l
,
t
)
}
.
It is not difficult to see that this frequency domain approach requires a large amount of memory (to store the samples of M(&ohgr;, t) and &PHgr;(&ohgr;, t), and computations (to obtain the system magnitudes and phases at new frequency track location.)
SUMMARY OF THE INVENTION
In accordance with one embodiment of the present invention, an improved method of pitch modification or frequency transformation includes the steps of getting the desired spectrum envelope, an approximation of the spectrum envelope of frequency scaled signal whitening or flattening of the spectrum envelope of the frequency scaled signal and applying back the desired spectrum envelope to the whitened frequency scaled signal.
These and other features of the invention will be apparent to those skilled in the art from the following detailed description of the invention, taken together with the accompanying drawings.
REFERENCES:
patent: 5233659 (1993-08-01), Ahlberg
patent: 5642465 (1997-06-01), Scott et al.
patent: 5884251 (1999-03-01), Kim et al.
patent: 5903866 (1999-05-01), Shoham
patent: 6104992 (2000-08-01), Gao et al.
Ding Yinong
McCree Alan V.
Yim Susan
Opsasnick Michael N.
Telecky , Jr. Frederick J.
Texas Instruments Incorporated
Troike Robert L.
Tsang Fan
LandOfFree
Spectral transformation of acoustic signals does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Spectral transformation of acoustic signals, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Spectral transformation of acoustic signals will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2532461