Data processing: speech signal processing – linguistics – language – Speech signal processing – Synthesis
Reexamination Certificate
2010-07-20
2011-12-27
Lerner, Martin (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Synthesis
C704S266000
Reexamination Certificate
active
08086456
ABSTRACT:
A speech synthesis system can select recorded speech fragments, or acoustic units, from a very large database of acoustic units to produce artificial speech. The selected acoustic units are chosen to minimize a combination of target and concatenation costs for a given sentence. However, as concatenation costs, which are measures of the mismatch between sequential pairs of acoustic units, are expensive to compute, processing can be greatly reduced by pre-computing and caching the concatenation costs. Unfortunately, the number of possible sequential pairs of acoustic units makes such caching prohibitive. A method for constructing an efficient concatenation cost database is provided by synthesizing a large body of speech, identifying the acoustic unit sequential pairs generated and their respective concatenation costs. By constructing a concatenation cost database in this fashion, the processing power required at run-time is greatly reduced with negligible effect on speech quality.
REFERENCES:
patent: 5740320 (1998-04-01), Itoh
patent: 5751907 (1998-05-01), Moebius et al.
patent: 5870706 (1999-02-01), Alshawi
patent: 5878393 (1999-03-01), Hata et al.
patent: 5913193 (1999-06-01), Huang et al.
patent: 5970460 (1999-10-01), Bunce et al.
patent: 6006181 (1999-12-01), Buhrke et al.
patent: 6101470 (2000-08-01), Eide et al.
patent: 6119086 (2000-09-01), Ittycheriah et al.
patent: 6125346 (2000-09-01), Nishimura et al.
patent: 6144939 (2000-11-01), Pearson et al.
patent: 6173263 (2001-01-01), Conkie
patent: 6202049 (2001-03-01), Kibre et al.
patent: 6233544 (2001-05-01), Alshawi
patent: 6266637 (2001-07-01), Donovan et al.
patent: 6266638 (2001-07-01), Stylianou
patent: 6366883 (2002-04-01), Campbell et al.
patent: 6370522 (2002-04-01), Agarwal et al.
patent: 6385580 (2002-05-01), Lyberg et al.
patent: 6505158 (2003-01-01), Conkie
patent: 6665641 (2003-12-01), Coorman et al.
patent: 6684187 (2004-01-01), Conkie
patent: 6697780 (2004-02-01), Beutnagel et al.
patent: 6701295 (2004-03-01), Beutnagel et al.
patent: 6950798 (2005-09-01), Beutnagel et al.
patent: 6961704 (2005-11-01), Phillips
patent: 6988069 (2006-01-01), Phillips
patent: 7013278 (2006-03-01), Conkie
patent: 7027568 (2006-04-01), Simpson et al.
patent: 7047194 (2006-05-01), Buskies
patent: 7082396 (2006-07-01), Beutnagel et al.
patent: 7124083 (2006-10-01), Conkie
patent: 7127396 (2006-10-01), Chu et al.
patent: 7233901 (2007-06-01), Conkie
patent: 7266497 (2007-09-01), Conkie et al.
patent: 7369994 (2008-05-01), Beutnagel et al.
patent: 7460997 (2008-12-01), Conkie
patent: 7565291 (2009-07-01), Conkie
patent: 7567896 (2009-07-01), Coorman et al.
patent: 7587320 (2009-09-01), Conkie et al.
patent: 7630896 (2009-12-01), Tamura et al.
patent: 7761299 (2010-07-01), Beutnagel et al.
patent: 2003/0115049 (2003-06-01), Beutnagel et al.
patent: 2004/0093213 (2004-05-01), Conkie
patent: 2004/0153324 (2004-08-01), Phillips
patent: 2005/0137870 (2005-06-01), Mizutani et al.
patent: 2005/0182629 (2005-08-01), Coorman et al.
patent: 2008/0077407 (2008-03-01), Beutnagel et al.
Chu et al., “Selecting Non-Uniform Units from a Very Large Corpus for Concatenative Speech Synthesizer,” 2001 IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 2, May 2001, pp. 785-788.
Lee et al., “A Very Low Bit Rate Speech Coder Based on a Recognition/Synthesis Paradigm,” IEEE Transactions on Speech and Audio Processing, vol. 9, No. 5, Jul. 2001, pp. 482-491.
Veldhuis et al., “On the Computation of the Kullback-Leibler Measure of Spectral Distances,” IEEE Transactions on Speech and Audio Processing, vol. 11, No. 1, Jan. 2003, pp. 100-103.
Robert Endre Trajan and Andrew Chi-Chih Yao, “Storing a Sparse Table”, Communication of the ACM, vol. 22:11, pp. 606-611, Nov. 1979.
Y. Stylianou (1998) “Concatenative Speech Synthesis using a Harmonic plus Noise Model”, Workshop on Speech Synthesis, Jenolan Caves, NSW, Australia, Nov. 1998.
Hunt et al., “Unit Selection in a Concatenative Speech Synthesis using a Large Speech Database,” 1996 IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 1, May 1996, pp. 373-376.
Beutnagel et al., “Rapid Unit Selection from a Large Speech Corpus for Concatenative Speech Synthesis”, AT&T Labs Research, Florham Park, New Jersey, 1999.
Webopedia, definition of “hashing”, http://www.webopedia.com/TERM/H/hashing.html. 1 page, Jan. 23, 2003.
TechTarget, definition of “hashing”, http://searchdatabase.techtarget.com/sDefinition/O,,sid13—gci212230,00.html, 2 pages. Jan. 23, 2003.
Beutnagel Mark Charles
Mohri Mehryar
Riley Michael Dennis
AT&T Intellectual Property II L.P.
Lerner Martin
LandOfFree
Methods and apparatus for rapid acoustic unit selection from... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Methods and apparatus for rapid acoustic unit selection from..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Methods and apparatus for rapid acoustic unit selection from... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4310891