Sound reproducing speed converter

Data processing: speech signal processing – linguistics – language – Speech signal processing – Synthesis

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

704262, G10L 1302

Patent

active

061156876

DESCRIPTION:

BRIEF SUMMARY
TECHNICAL FIELD

The present invention relates to an apparatus for converting a voice reproducing rate to reproduce digitized voice signals at an arbitrary rate without transforming (changing) a pitch of voice.
In this specification(description), "voice" and "voice signal" are used to represent all acoustic signals generated from instruments and others, not only voice uttered from a person.


BACKGROUND ART

As a method to convert a reproducing rate into an arbitrary rate without transforming a pitch of voice, PICOLA (Pointer Interval Control Overlap and Add) method is known. The principle of PICOLA method is introduced by "Time-Scale Modification Algorithm for Speech by Use of Pointer Interval Control Overlap and Add (PICOLA) and Its Evaluation" written by MORITA, Naotaka and ITAKURA, Fumitada in Proceeding of National Meeting of The Acoustic Society of Japan 1-4-14 (October, 1986).
And, the application of PICOLA method for voice signals divided into frames to convert a reproducing rate with fewer buffer memories is disclosed in Japanese unexamined patent publication No.8-137491.
FIG. 9 illustrates a block diagram of a conventional apparatus for converting a voice reproducing rate in PICOLA method. In the apparatus for converting a voice reproducing rate illustrated in FIG. 9, digitized voice signals are recorded in recording media 1, and framing section 2 fetches a voice signal in a frame of a predetermined length LF sample from recording media 1. The voice signal fetched by framing section 2 is provided into pitch period calculating section 6 along with stored in buffer memory 3 temporarily. Pitch period calculating section 6 calculates pitch period Tp of the voice signal to provide it into waveform overlapping section 4 along with storing a pointer of processing start position into buffer memory 3. Waveform overlapping section 4 overlaps waveforms of voice signals stored in buffer memory 3 using the pitch period of the input voice, then outputs the overlapped waveform into waveform synthesizing section 5. Waveform synthesizing section 5 synthesizes an output voice signal waveform from the voice signal waveform stored in buffer memory 3 and the overlapped waveform processed at waveform overlapping section 4 to provide the output voice.
In this apparatus for converting a voice reproducing rate, a reproducing rate is converted without transforming a pitch according to the process in the following.
First, a processing method for high rate reproducing is explained with FIG. 10 and FIG. 11. In the figures, P0 is a pointer indicating a head of a waveform overlap processing frame. In the waveform overlap processing, a processing frame is a LW sample with a length of two periods of voice pitch period Tp. And, when a rate of input voice is 1 and a desired reproducing rate is given r, L is the number of samples given by the following formulation. input voice of Tp+L sample is reproduced as an output voice of L sample as mentioned later. Accordingly, r=(Tp+L)/L is given, then the formulation (1) is introduced.
An input voice fetched from recording media 1 by framing section 2 is stored in buffer memory 3. Concurrently, pitch period calculating section 6 calculates pitch period Tp of the input voice to input it to waveform overlapping section 4. And, pitch period calculating section 6 calculates L from pitch period Tp using the formulation (1), determines P0' that is a starting position for next processing and provides it into buffer memory 3 as a pointer in the buffer memory.
Waveform overlapping section 4 fetches a waveform of waveform overlap processing frame LW (=2Tp) sample from a processing starting point indicated by pointer P0 from buffer memory 3, decreases the first part of the processing frame (waveform A) in the time axis direction and increases the latter part of the processing frame (waveform B) in the time axis direction according to the the triangle window function, adds waveform A and waveform B, then calculates overlapped waveform c.
Waveform synthesizing section 5 removes the waveform of the wave

REFERENCES:
patent: 4577343 (1986-03-01), Oura
patent: 4937868 (1990-06-01), Taguchi
patent: 5369730 (1994-11-01), Yajima
patent: 5630013 (1997-05-01), Suzuki et al.
patent: 5765127 (1998-06-01), Nishiguchi et al.
patent: 5832437 (1998-11-01), Nishiguchi et al.
patent: 5847303 (1998-12-01), Matsumoto
patent: 5950152 (1999-09-01), Arai et al.
patent: 5991724 (1999-09-01), Kojima et al.
patent: 5991725 (1999-09-01), Asghar et al.
An article by Morita et al., entitled "Time-Scale Modification Algorithm For Speech By Use of Pointer Interval Control Overlap and Add (PICOLA) and its Evaluation", Proceeding of National Meeting of the Acoustic Society of Japan, 1-4-14, Oct. 1986.
An English Language abstract of Morita et al. article.
An English Language abstract of JP 7-077999.
An English Language abstract of JP 1-267700.
An English Language abstract of JP 9-152889.
An English Language abstract of JP 8-137491.
An English Language abstract of JP 8-202397.
An English Language abstract of JP8-022300.
An English Language abstract of JP 7-319496.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Sound reproducing speed converter does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Sound reproducing speed converter, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Sound reproducing speed converter will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2222609

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.