Synthesizing phoneme string of predetermined duration by...

Data processing: speech signal processing – linguistics – language – Speech signal processing – Synthesis

Reexamination Certificate

Rate now

[ 0.00 ] – not rated yet Voters 0 Comments 0

Details Synthesizing phoneme string of predetermined duration by... Synthesizing phoneme string of predetermined duration by...

: 1999-03-09
: 2003-04-08
: S{haeck over (m)}its, T{overscore (a)}livaldis Ivars (Department: 2641)
: Data processing: speech signal processing, linguistics, language
: Speech signal processing
: Synthesis

: C704S267000, C704S278000
: Reexamination Certificate
: active
: 06546367
: ABSTRACT:

BACKGROUND OF THE INVENTION
The present invention relates to a method and an apparatus for speech synthesis utilizing a rule-based synthesis method, and a storage medium storing computer-readable programs for realizing the speech synthesizing method.
As a method of controlling a phoneme duration, a conventional rule-based speech synthesizing apparatus employs a control-rule method determined based on statistics related to a phoneme duration (Yoshinori SAGISAKA, Youichi TOUKURA, “Phoneme Duration Control for Rule-Based Speech Synthesis,” The Journal of the Institute of Electronics and Communication Engineers of Japan, vol. J67-A, No. 7 (1984) pp 629-636), or a method of employing Categorical Multiple Regression as a technique of multiple regression analysis (Tetsuya SAKAYORI, Shoichi SASAKI, Hiroo KITAGAWA, “Prosodies Control Using Categorical Multiple Regression for Rule-Based Synthesis,” “Report of the 1986 Autumn Meeting of the Acoustic Society of Japan,” 3-4-17 (1986-10)).
However, according to the above conventional technique, it is difficult to specify the speech production time of a phoneme string. For instance, in the control-rule method, it is difficult to determine a control rule that corresponds to a specified speech-production time. Moreover, if input data includes an exception in the control rule method, or if a satisfactory estimation value is not obtained in the method of Categorical Multiple Regression, it becomes difficult to obtain a phoneme duration that sounds natural.
In a case of controlling a phoneme duration by using control rules, it is necessary to weigh the statistics (average value, standard deviation and so on) while taking into consideration of the combination of preceding and succeeding phonemes, or it is necessary to set an expansion coefficient. There are various factors to be manipulated, e.g., a combination of phonemes depending on each case, parameters such as weighting and expansion coefficients and the like. Moreover, the operation method (control rules) must be determined by rule of thumb. Therefore, in a case where a speech-production time of a phoneme string is specified, the number of combinations of phonemes become extremely large. Furthermore, it is difficult to determine control rules applicable to any combination of phonemes in which a total phoneme duration is close to the specified speech-production time.
SUMMARY OF THE INVENTION
The present invention is made in consideration of the above situation, and has as its object to provide a speech synthesizing method and apparatus as well as a storage medium, which enables setting the phoneme duration for a phoneme string so as to achieve a specified speech-production time, and which can provide a natural phoneme duration regardless of the length of speech production time.
In order to attain the above object, the speech synthesizing apparatus according to an embodiment of the present invention has the following configuration. More specifically, the speech synthesizing apparatus for performing speech synthesis according to an inputted phoneme string comprises: storage means for storing statistical data related to a phoneme duration of each phoneme; determining means for determining speech production time of a phoneme string in a predetermined section; setting means for setting the phoneme duration corresponding to the speech-production time of each phoneme constructing the phoneme string, based on the statistical data of each phoneme obtained from the storage means; and generating means for generating a speech waveform by connecting phonemes using the phoneme duration.
Furthermore, the present invention provides a speech synthesizing method executed by the above speech synthesizing apparatus. Moreover, the present invention provides a storage medium storing control programs for having a computer realize the above speech synthesizing method.
Other features and advantages of the present invention will be apparent from the following description taken in conjunction with the accompanying drawings, in which like reference characters designate the same or similar parts throughout the figures thereof.

REFERENCES:
patent: 5682502 (1997-10-01), Ohtsuka et al.
patent: 6038533 (2000-03-01), Buchsbaum et al.
patent: 6064960 (2000-05-01), Bellegarda et al.
patent: 6101470 (2000-08-01), Eide et al.
patent: WO 96/42079 (1996-12-01), None
Keikichi Hirose, Mayumi Sakata, and Hiromichi Kawanami “Synthesizing dialogue speech of Japanese based on the quantitative analysis of prosodic features,” Proc. ICSLP 96, vol. 1, p. 378-381, Oct. 1996.*
Gerard Bailly “Integration of Rhythmic and Syntactic Constraints in a Model of Generation of French Prosody,” Speech Communication, vol. 8, No. 2, p. 137-146, Jun. 1989.*
“Phoneme Control Using the Method of Categorial Multiple Regression for Synthesis by Rule,” Sakayori, et al., Report of the 1986 Autumn Meeting of the Acoustic Society of Japan, 3-4-17, Oct. 1986.
Phoneme Duration Control for Speech Synthesis by Rule, Yoshinori Sagisaka, et al., The Journal of the Institute of Electronics and Communication Engineers of Japan, vol. J67-A, No. 7, 1984, pp. 629-636.
Mobius, et al. “Modeling Segmental Duration In German Text-to-Speech Synthesis”, Proceedings ICSLP 96, 4thInternat'l Conf. pp. 2395-2398, vol. 4, Oct. 3-6, 1996.
Campbell, et al., “Duration Pitch And Diphones In the CSTR TTS System,” Proceedings of Internat'l Conf. on Spoken Language Processing, Nov. 18, 1990, vol. 2, pp. 825-828.
The Transaction of the Institute of Electronics and Comm. Eng. Of Japan, vol. J67-A, No. 7, Jul. 1984, pp. 629-636, “Phoneme Duration Control for Speech Synthesis By Rule,” Sagisaka, et al.

Affiliated with

Otsuka Mitsuru

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Also associated with

Canon Kabushiki Kaisha

Corporate Assignee

[ 0.00 ] – not rated yet Voters 0 Comments 0

Fitzpatrick ,Cella, Harper & Scinto

Law Firm

[ 0.00 ] – not rated yet Voters 0 Comments 0

S{haeck over (m)}its T{overscore (a)}livaldis Ivars

Examiner

[ 0.00 ] – not rated yet Voters 0 Comments 0

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Synthesizing phoneme string of predetermined duration by... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Synthesizing phoneme string of predetermined duration by..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Synthesizing phoneme string of predetermined duration by... will most certainly appreciate the feedback.

Rate now

Comments { 0 }

Profile ID: LFUS-PAI-O-3110885

All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.

Canada

Charities
Companies
MP Candidates
Patents
Employee Salary Disclosure

World

Places of the World
Scientific Papers

United States

Banks
Companies
Counties
Patents
Employee Salary Disclosure