Methods and apparatus for rapid acoustic unit selection from...

Data processing: speech signal processing – linguistics – language – Speech signal processing – Synthesis

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S266000

Reexamination Certificate

active

08086456

ABSTRACT:
A speech synthesis system can select recorded speech fragments, or acoustic units, from a very large database of acoustic units to produce artificial speech. The selected acoustic units are chosen to minimize a combination of target and concatenation costs for a given sentence. However, as concatenation costs, which are measures of the mismatch between sequential pairs of acoustic units, are expensive to compute, processing can be greatly reduced by pre-computing and caching the concatenation costs. Unfortunately, the number of possible sequential pairs of acoustic units makes such caching prohibitive. A method for constructing an efficient concatenation cost database is provided by synthesizing a large body of speech, identifying the acoustic unit sequential pairs generated and their respective concatenation costs. By constructing a concatenation cost database in this fashion, the processing power required at run-time is greatly reduced with negligible effect on speech quality.

REFERENCES:
patent: 5740320 (1998-04-01), Itoh
patent: 5751907 (1998-05-01), Moebius et al.
patent: 5870706 (1999-02-01), Alshawi
patent: 5878393 (1999-03-01), Hata et al.
patent: 5913193 (1999-06-01), Huang et al.
patent: 5970460 (1999-10-01), Bunce et al.
patent: 6006181 (1999-12-01), Buhrke et al.
patent: 6101470 (2000-08-01), Eide et al.
patent: 6119086 (2000-09-01), Ittycheriah et al.
patent: 6125346 (2000-09-01), Nishimura et al.
patent: 6144939 (2000-11-01), Pearson et al.
patent: 6173263 (2001-01-01), Conkie
patent: 6202049 (2001-03-01), Kibre et al.
patent: 6233544 (2001-05-01), Alshawi
patent: 6266637 (2001-07-01), Donovan et al.
patent: 6266638 (2001-07-01), Stylianou
patent: 6366883 (2002-04-01), Campbell et al.
patent: 6370522 (2002-04-01), Agarwal et al.
patent: 6385580 (2002-05-01), Lyberg et al.
patent: 6505158 (2003-01-01), Conkie
patent: 6665641 (2003-12-01), Coorman et al.
patent: 6684187 (2004-01-01), Conkie
patent: 6697780 (2004-02-01), Beutnagel et al.
patent: 6701295 (2004-03-01), Beutnagel et al.
patent: 6950798 (2005-09-01), Beutnagel et al.
patent: 6961704 (2005-11-01), Phillips
patent: 6988069 (2006-01-01), Phillips
patent: 7013278 (2006-03-01), Conkie
patent: 7027568 (2006-04-01), Simpson et al.
patent: 7047194 (2006-05-01), Buskies
patent: 7082396 (2006-07-01), Beutnagel et al.
patent: 7124083 (2006-10-01), Conkie
patent: 7127396 (2006-10-01), Chu et al.
patent: 7233901 (2007-06-01), Conkie
patent: 7266497 (2007-09-01), Conkie et al.
patent: 7369994 (2008-05-01), Beutnagel et al.
patent: 7460997 (2008-12-01), Conkie
patent: 7565291 (2009-07-01), Conkie
patent: 7567896 (2009-07-01), Coorman et al.
patent: 7587320 (2009-09-01), Conkie et al.
patent: 7630896 (2009-12-01), Tamura et al.
patent: 7761299 (2010-07-01), Beutnagel et al.
patent: 2003/0115049 (2003-06-01), Beutnagel et al.
patent: 2004/0093213 (2004-05-01), Conkie
patent: 2004/0153324 (2004-08-01), Phillips
patent: 2005/0137870 (2005-06-01), Mizutani et al.
patent: 2005/0182629 (2005-08-01), Coorman et al.
patent: 2008/0077407 (2008-03-01), Beutnagel et al.
Chu et al., “Selecting Non-Uniform Units from a Very Large Corpus for Concatenative Speech Synthesizer,” 2001 IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 2, May 2001, pp. 785-788.
Lee et al., “A Very Low Bit Rate Speech Coder Based on a Recognition/Synthesis Paradigm,” IEEE Transactions on Speech and Audio Processing, vol. 9, No. 5, Jul. 2001, pp. 482-491.
Veldhuis et al., “On the Computation of the Kullback-Leibler Measure of Spectral Distances,” IEEE Transactions on Speech and Audio Processing, vol. 11, No. 1, Jan. 2003, pp. 100-103.
Robert Endre Trajan and Andrew Chi-Chih Yao, “Storing a Sparse Table”, Communication of the ACM, vol. 22:11, pp. 606-611, Nov. 1979.
Y. Stylianou (1998) “Concatenative Speech Synthesis using a Harmonic plus Noise Model”, Workshop on Speech Synthesis, Jenolan Caves, NSW, Australia, Nov. 1998.
Hunt et al., “Unit Selection in a Concatenative Speech Synthesis using a Large Speech Database,” 1996 IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 1, May 1996, pp. 373-376.
Beutnagel et al., “Rapid Unit Selection from a Large Speech Corpus for Concatenative Speech Synthesis”, AT&T Labs Research, Florham Park, New Jersey, 1999.
Webopedia, definition of “hashing”, http://www.webopedia.com/TERM/H/hashing.html. 1 page, Jan. 23, 2003.
TechTarget, definition of “hashing”, http://searchdatabase.techtarget.com/sDefinition/O,,sid13—gci212230,00.html, 2 pages. Jan. 23, 2003.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Methods and apparatus for rapid acoustic unit selection from... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Methods and apparatus for rapid acoustic unit selection from..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Methods and apparatus for rapid acoustic unit selection from... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4310891

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.