Data processing: speech signal processing – linguistics – language – Speech signal processing – Synthesis
Reexamination Certificate
2001-03-28
2004-08-17
McFadden, Susan (Department: 2655)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Synthesis
C704S267000, C704S278000
Reexamination Certificate
active
06778960
ABSTRACT:
FIELD OF THE INVENTION
The present invention relates to a speech information processing method and apparatus for setting the duration of a phoneme upon speech synthesis, and a computer-readable storage medium holding a program for execution of a speech information processing method.
BACKGROUND OF THE INVENTION
Recently, a speech synthesis apparatus has been developed so as to convert an arbitrary character string into a phonological series and convert the phonological series into synthesized speech in accordance with a predetermined speech synthesis by rule.
However, the synthesized speech outputted from the conventional speech synthesis apparatus sounds unnatural and mechanical in comparison with natural speech sounded by human being.
For example, in a phonological series “o, X, s, e, i” of a character series “onsei”, the accuracy of a rule for controlling the duration of generating each phoneme is considered as one of the factors of the awkward-sounding result. If the accuracy is low, as appropriate duration cannot be assigned to each phoneme, the synthesized speech becomes unnatural and mechanical.
SUMMARY OF THE INVENTION
The present invention has been made in consideration of the above prior art, and has as its object to provide a speech information processing method and apparatus for setting the duration of phonological series with high accuracy and setting natural phonological duration in accordance with phonemic/linguistic environment.
To attain the foregoing objects, the present invention provides a speech information processing apparatus comprising: means for obtaining a duration of a predetermined unit of phonological series based on a duration model for an entire segment; means for obtaining a duration of each of phonemes constructing the phonological series based on a duration model for a partial segment; setting means for setting a duration of each of the phonemes based on the duration of the phonological series and the duration of each of the phonemes; and speech synthesis means for synthesizing speech based on the duration of each of the phonemes set by the setting means.
Further, the present invention provides a speech information processing method comprising: a step of obtaining a duration of a predetermined unit of phonological series based on a duration model for an entire segment; a step of obtaining a duration of each of phonemes constructing the phonological series based on a duration model for a partial segment; a setting step of setting a duration of each of the phonemes based on the duration of the phonological series and the duration of each of the phonemes; and a speech synthesis step of synthesizing speech based on the duration of each of the phonemes set at the setting step.
Other features and advantages of the present invention will be apparent from the following description taken in conjunction with the accompanying drawings, in which like reference characters designate the same name or similar parts throughout the figures thereof.
REFERENCES:
patent: 5633984 (1997-05-01), Aso et al.
patent: 5745650 (1998-04-01), Otsuka et al.
patent: 5745651 (1998-04-01), Otsuka et al.
patent: 5845047 (1998-12-01), Fukada et al.
patent: 6546367 (2003-04-01), Otsuka
patent: 0 942 410 (1999-09-01), None
patent: 11-259095 (1999-09-01), None
LandOfFree
Speech information processing method and apparatus and... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Speech information processing method and apparatus and..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech information processing method and apparatus and... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3348527