Method for synthesizing voiceless consonants

Data processing: speech signal processing – linguistics – language – Speech signal processing – Synthesis

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

704258, G10L 1306

Patent

active

061121789

DESCRIPTION:

BRIEF SUMMARY
BACKGROUND OF THE INVENTION

1. Field of the Invention
The invention relates to a method for synthesising speech using concatenation and, in particular, synthesising voiceless consonants.
2. Discussion of the Background
It is known, in a speech synthesis method, to link together, i.e. concatenate, small sections of sounds which have been recorded by a human speaker. The sounds consist of diphones (i.e. sounds from two phonemes), or polyphones (i.e. a number of phonemes). The advantage of the known method is that the main part of the coarticulation (i.e. common articulation--that part of the pronunciation of a phoneme that is influenced by surrounding phonemes) is located in the area around the phoneme limit, which is included in the recorded sounds, and, as a consequence of this, is reproduced, in a natural human-like manner, in the synthesized speech. The known method also covers the generation of synthetic speech with arbitrary phoneme durations and optional fundamental tone curves, even in those cases where the fundamental tone is in the same register as the person who made the recording from which the speech is synthesised.
In accordance with the known speech synthesis method, the creation of a synthetic waveform is effected by arranging for suitably selected parts of the recorded polyphones to be "out-windowed" with a Hanning-window and copied into suitably selected places in the synthetic waveform. For voiced speech, i.e. voicing sounds, the Hanning-windows are placed in such a manner that the centre of the window is located at the excitation point of a glottis pulse, i.e. at the point in time where the vocal cords are closed.
With unvoiced speech, for example, voiceless consonants, there is no known way of placing the Hanning-windows, for effecting speech synthesis. This problem is, however, generally overcome, in accordance with the known methods, by using a fixed interval between the Hanning-windows. The use of this method, for the synthesis of phonemes of long duration, gives rise to problems, especially in those cases where the synthesised sound needs to be longer than the recorded sound. In such cases, it is necessary to copy the same "out-windowed" signal, in a sequential manner, into a number of suitably selected places in the synthetic waveform. Most people generally have good hearing and are, therefore, able to perceive periodicities, resulting in the synthesised consonants being heard as sounds having a whistling character. If the length of the Hanning-window is larger, a `chuff-chuff`-like sound will be experienced. This problem can be reduced by reversing the content of every second Hanning-window, i.e. by being played back in reverse. However, this will not totally eliminate the problem.


SUMMARY OF THE INVENTION

It is an object of the present invention to provide a method for synthesising speech using concatenation and, in particular, the synthesis of voiceless consonants which overcomes the problems outlined above.
The invention provides a method for synthesising speech using concatenation and Hanning-windows, in which a synthetic waveform is formed by concatenation of suitably selected parts of recorded human speech, said selected parts being out-windowed with a Hanning-window and copied into suitably selected locations in the synthetic waveform, characterised in that said method is adapted to synthesise unvoiced consonants and includes the steps of palindromically copying suitably selected parts of a waveform of said recorded human speech to form a synthesized waveform for said unvoiced consonant using concatenation. The method may be used for diphone, or polyphone, synthesis.
The invention also provides a method for synthesising speech using concatenation and Hanning-windows, in which a synthetic waveform is formed by concatenation of suitably selected parts of recorded human speech, said selected parts being out-windowed with a Hanning-window and copied into suitably selected locations in the synthetic waveform, characterised in that said method is used for diphone synthesis and

REFERENCES:
patent: 4692941 (1987-09-01), Jacks et al.
patent: 4833718 (1989-05-01), Sprague
patent: 5659664 (1997-08-01), Kaja
Hamon et al, International Conference on Acoustics, Speech and Signal Processing, "A Diphone Synthesis System Based on Time-Domain Prosodic Modifications of Speech", May 1989, pp. 238-241.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method for synthesizing voiceless consonants does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method for synthesizing voiceless consonants, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for synthesizing voiceless consonants will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-1258821

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.