Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Reexamination Certificate
1999-10-15
2004-02-03
Chawan, Vijay (Department: 2654)
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
C704S501000, C704S207000, C704S200100, C704S258000, C707S793000, C345S215000
Reexamination Certificate
active
06687664
ABSTRACT:
BACKGROUND OF THE INVENTION
Scrubbing systems are used in many digital audio workstations (DAW). These systems have their origin in analog tape playback systems where a location on an analog tape audio recording could be located by “scrubbing” the tape back and forth across the play head of the playback device thus causing playback in the speed and direction of movement of the tape. As known in the art, “digital audio scrubbers” are systems in which the user scans portions of an audio recording with an input device, which results in the audio playback of the scanned portion; the instantaneous playback position of the audio tracks the position of the user's input device. The system is typically used to locate splice points or audio artifacts in the program.
DAWs often have two methods of scrubbing. The first method allows the user to control the instantaneous playback position of the audio data. The second method allows the user to control the playback rate and direction of the audio data. In the first method a plot of an audio waveform is displayed and the user drags a mouse or other input device that directs a control icon on the display back and forth over a portion of the waveform to be played. As the control icon moves it directs the instantaneous playback position of the audio to be played. The rate of change of position of the control icon thus ultimately directs the audio playback speed and direction. If the user scrubs the mouse from left to right the audio will play back in the forward direction. Likewise, a mouse movement from right to left will result in reverse playback. If the user stops moving the mouse the audio is frozen in the current location. Scrubbing is activated either by holding down a key, or a mouse button, or it is toggled on and off by clicking a mouse button or with a key press.
In a second method a “jog-wheel” is used. The “jog-wheel” can be a physical input device connected to the scrubbing system or it can be a virtual input device, such as a slider, on the graphical display and controlled with a mouse. The “jog-wheel” is moved in one direction to start forward playback and the opposite direction to start reverse playback. When the “jog-wheel” is released it returns to center automatically and playback stops. The playback speed is controlled by the amount the “jog-wheel” is moved from its resting position. In both methods of scrubbing as playback occurs a visual indication of the playing audio is shown. Often a cursor in the form of a simple line is moved over the audio waveform.
Typical audio-visual scrubbing systems use sample rate conversion to adjust the speed of the audio playback. When scrubbing in the mode that controls speed and direction directly this is fairly straightforward. When scrubbing in the mode that controls instantaneous playback position the speed is constantly adjusted to try and track the playback position indicated from the user. Using sample rate conversion offers two disadvantages: 1) The playback pitch is shifted proportionately to the playback speed. At very slow and fast playback speeds the audio will sound quite differently from the original. Also, when the user stops moving the input device the audio will be muted. 2) Many systems have a large output latency, which result in a system that is difficult to control.
It is desired to have a system where 1) playback speed can be controlled independently of pitch, 2) synchronization between audio playback and the user's input device can be obtained, and 3) it is possible to for the user to hold the input device at one position in the audio waveform and have the audio at that position sustain playback.
SUMMARY OF THE INVENTION
According to one aspect of the invention, an audio scrubber GUI includes a representation of a media file, a control icon, and a user input device. An audio system utilizes a phase-vocoder to implement playback of a portion of the media file indicated by the control icon. A user input device is used to manipulate the control icon to indicate the instantaneous position, or equivalently the direction and speed of playback of the media file. The phase-vocoder allows the playback rate to be varied while preserving pitch and also allows for pitch modification independent from the playback rate.
According to another aspect of the invention, the audio system synchronizes the playback of the media file to the asynchronous clock output by the audio scrubber system. For this aspect the instantaneous position of the input device is periodically translated to a playback media time. This playback media time can be viewed as a clock signal to synchronize audio playback with.
According to another aspect of the invention, the media file is analyzed in real time to facilitate real time playback in response to manipulations of the control icon.
According to another aspect of the invention, a specified motion of the control icon can cause pitch shifting independent of playback rate or if playback is paused.
REFERENCES:
patent: 5600775 (1997-02-01), King et al.
patent: 5826102 (1998-10-01), Escobar et al.
patent: 6262724 (2001-07-01), Crow et al.
patent: 6526325 (2003-02-01), Sussman et al.
Laroche et al., (“Improved Phase Vocoder Time-Scale modification of Audio”, IEEE transactions on Speech and Audio processing, May 1999, vol. 7, issue 3, pp. 323-332).*
Cox et al., (“Low Bit-Rate Speech Coders for Multimedia Communication”, IEEE Communications Magazine, vol. 34, Issue 41, pp. 34-41, Dec. 1996).*
Laroche et al., (“New Phase-vocoder techniques for pitch-shifting, harmonizing and other exotic effects”, 1999 Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 91-94).*
Laroche et al., (“Phase-vocoder: about this phasiness business”, 1997 IEEE ASSP Workshop on Applications of Signal Processing Audio and Acoustics, pp. 19-22).*
Sylvestre et al., (“Time-scale modification of speech using an incremental time-frequency approach with waveform structure compensation”, ICASSP-92, 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing, 1992, vol. 1, pp. 81-84).*
Quatieri et al., (“Shape invariant time-scale and pitch modification of speech”, IEEE Transactions on Signal Processing, vol. 40 Issue 3, pp. 497-510).
Dolson Mark
Laroche Jean
Sussman Robert
Chawan Vijay
Creative Technology Ltd.
LandOfFree
Audio-visual scrubbing system does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Audio-visual scrubbing system, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Audio-visual scrubbing system will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3298196