Phase-vocoder pitch-shifting

Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S205000, C704S229000, C704S269000, C704S203000

Reexamination Certificate

active

06549884

ABSTRACT:

FIELD OF THE INVENTION
This invention relates generally to the field of signal processing, and more particularly, to a method and apparatus for pitch-shifting an information signal.
BACKGROUND OF THE INVENTION
Pitch-shifting is the operation whereby the pitch of a signal (music, speech, audio or other information signal), is altered while its duration remains unchanged. Pitch shifting may be used in audio processing, such as in music synthesis, where the original pitch of musical sounds of a known duration may be shifted to form higher or lower pitched sounds of the same duration. For example, pitch-shifting can be used to transpose a song between keys or to change the sound of a person's voice to achieve a desired special effect.
Typically, use of a phase-vocoder has always been a highly praised technique for time-scale modification of speech and audio signals. This is because the resulting signal is usually free of artifacts typically encountered in other time domain techniques. The standard way to carry out pitch-shifting using the phase-vocoder is to first perform a time-scale modification, then perform a time-domain sample rate conversion to obtain the resulting signal. For example, in order to raise the pitch of a signal by a factor of two while keeping its duration unchanged, one would use the phase-vocoder to time-expand the signal by a factor of two, leaving the pitch unchanged, and then down-sample the resulting signal by a factor of two, thereby restoring the original duration.
Unfortunately, using a phase-vocoder to perform pitch-shifting has several undesirable drawbacks. One drawback is that the processing cost per output sample is a function of the pitch modification factor. For example, if the modification factor is large, the number of mathematical operations increases correspondingly. The mathematical operations may also require complex functions, such as computing arctangents or phase unwrapping. Another drawback is that only one ‘linear’ pitch-shift modification can be performed at a time. This is true because the frequencies of all the components are multiplied by the same modification factor. As a result, more complex processes, like signal harmonizing or chorusing, cannot be implemented in one pass and therefore have high processing costs.
Given the limitations of the phase-vocoder, it is desirable to have a system that can perform processes like pitch-shifting in a computationally efficient manner. Such a system should also be capable of performing a variety of linear and non-linear pitch-shifting functions in a single pass. In doing so, special effects such as harmonizing and chorusing could be efficiently and easily implemented.
SUMMARY OF THE INVENTION
One aspect of the present invention solves the problems associated with pitch-shifting by providing a system for pitch-shifting signals in the frequency domain. This eliminates the expensive time domain resampling stage and allows the computational costs to become independent of the pitch modification factor. Unlike the prior art, the system does not require the calculation of arctangents nor phase unwrapping when modifying the phase in the frequency domain, thus achieving a significant reduction in the number of computations. For example, in one embodiment, the system supports a 50% overlap (as opposed to a 75% overlap in standard implementations), which cuts the computational cost by a factor of 2.
In an embodiment of the invention, a method is provided for pitch-shifting a signal by converting the signal to a frequency domain representation and then identifying a region in the frequency domain representation. The region being located at a first frequency location. Next, the region is shifted to a second frequency location to form a adjusted frequency domain representation. Finally, the adjusted frequency domain representation is transformed to a time domain signal representing the input signal with shifted pitch.


REFERENCES:
patent: 5384891 (1995-01-01), Asakawa et al.
patent: 5567901 (1996-10-01), Gibson et al.
patent: 5687240 (1997-11-01), Yoshida et al.
patent: 5870704 (1999-02-01), Laroche
patent: 5890108 (1999-03-01), Yeldener et al.
patent: 6073100 (2000-06-01), Goodridge, Jr.
patent: 6112169 (2000-08-01), Dolson
patent: 6182042 (2001-01-01), Peevers
Sylvestre et al., (“Time-scale Modification of Speech Using Incremental Time-Frequency Approach with Waveform Structure Compensation,” IEEE International Conference on Acoustics, Speech, and Signal Processing, Mar. 23-26, 1992, pp. 81-84).*
Laroche et al., (“Phase vocoder : about this phasiness business,” 1997 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 1-4, Oct. 1997).*
Laroche et al., (“Improved phase vocoder time-scale modification of audio,” IEEE Transactions on Speech and Audio Processing, vol. 7, issue 3, pp. 323-332, may 1999).*
Allen et al. “A Unified Approach to Short-Time Fourier Analysis and Synthesis,” Proc. IEEE 65:1558-1564 (1977).
Bershad “Analysis of the Normalized LMS Algorithm with Gaussian Inputs,” IEEE Transactions on Acoustics, Speech, and Signal Processing 34:793-806 (1986).
Ferreira “An odd-DFT based approach to time-scale expansion of audio signals,” IEEE Transactions on Speech and Audio Processing.7:441-453 (1999).
Flanagan et al. “Phase vocoder,” Bell Syst. Tech. J. 45:1493-1509 (1966).
George et al. “Analysis-By-Synthesis/Overlap-Add Sinusoidal Modeling Applied to the Analysis and Synthesis of Musical Tones,” J. Audio Eng. Soc. 40:497-516 (1992).
Laakso et al. “Splitting the Unit Delay,” IEEE Signal Processing Mag., 13:30-60 (1996).
Laroche “Time and pitch scale modification of audio signals,” inApplications of Digital Signal Processing to Audio and Acoustics, M. Kahrs and K. Brandenburg eds., Kluwer, Norwell, MA, (1998).
Marques et al. “Harmonic Coding at 4.8 KB/S,” Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing 1:17-20, (1990).
Moulines et al. “Non parametric techniques for pitch-scale and time-scale modification of speech,” Speech Communication 16:175-205 (1995).
Portnoff “Time-scale modifications of speech based on short-time Fourier analysis,” IEEE Trans. Acoust., Speech, Signal Processing 29:374-390 (1981).
Puckette “Phase-locked vocoder” Proc. Proc. IEEE ASSP Workshop on App. of Sig. Proc. to Audio and Acous., New Paltz, NY (1995).
Putnam et al. “Design of Fractional Delay Filters Using Convex Optimization,” Proc. IEEE ASSP Workshop on App. of Sig. Proc. to Audio and Acous., New Paltz, NY (1997).
Serra et al. “Spectral Modeling Synthesis: a Sound Analysis/Synthesis System Based on a Deterministic Plus Stochastic Decomposition,” Computer Music J. 14:12-24 (1990).
Smith et al. “A flexible Sampling-Rate Conversion Method,” Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, San Diego, CA, Mar. 1984.
Valimaki et al. “Fractional Delay Digital Filters” Proc. IEEE Int. Symposium on Circuits and Systems, Chicago, IL (1993).
Williamson et al. “Fir Approximation of Fractional Sample Delay Systems,” IEEE Trans. Circuit and Syst.-II 43:269-271 (1996).
Almeida, et al., “Variable-Frequency Synthesis: An Improved Harmonic Coding Scheme,” Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, pp. 27.5.1-27.5.4 (1984).
McAulay, et al., “Speech Analysis/Sythesis Based on a Sinusoidal Representation,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-34, No. 4, pp. 744-754 (1986).
Tassart et al., “Analytical Approximations of Fractional Delays: Lagrange Interpolators and Allpass Filters,” Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, Munich, Germany (1997).

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Phase-vocoder pitch-shifting does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Phase-vocoder pitch-shifting, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Phase-vocoder pitch-shifting will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3094275

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.