Data processing: speech signal processing – linguistics – language – Speech signal processing – Synthesis
Patent
1995-07-07
2000-03-14
Isen, Forester W.
Data processing: speech signal processing, linguistics, language
Speech signal processing
Synthesis
704255, G10L 502, G10L 400
Patent
active
060385332
ABSTRACT:
A system and method are described for determining a near-optimum subset of data, based on a selected model, from a large corpus of data. Sets of feature vectors corresponding to natural or other preselected divisions of the data corpus are mapped into matrices representative of such divisions. The invention operates to find a submatrix of full rank formed as a union of one or more of those division-based matrices. A greedy algorithm utilizing Gram-Schmidt orthonormalization operates on the division matrices to find a near optimum submatrix and in a time bound representing a substantial improvement over prior-art methods. An important application of the invention is the selection of a small number of sentences from a corpus of a very large number of such sentences from which the parameters of a duration model for speech synthesis can be estimated.
REFERENCES:
patent: 4979216 (1990-12-01), Malsheen et al.
patent: 5204905 (1993-04-01), Mitone
patent: 5230037 (1993-07-01), Giustiani et al.
patent: 5268990 (1993-12-01), Cohen et al.
patent: 5581655 (1996-12-01), Cohen et al.
Van Santen, J P H "Perceptual Experiments for Diagnostic Testing of Text to Speech Systems", Computer Speech and Language, vol. 7, No. 1., Jan. 1, 1993 pp. 49-100, XP000354661, Abstract paragraph 2.1.2.
Van Santen, J P H et al. "The Analysis of Contextual Effects on Segmental Duration"; Computer Speech and Language, vol. 4, No. 4., Oct. 1, 1990, pp. 359-390, XP000202888, Abstract, Paragraphs 3.1 and 3.2.
Macarron, A et al. "Generation of Duration Rules for Spanish Text to Speech Synthesizer", Eurospeech 91, 2nd European Conference on Speech Communication and Technology Proceedings, Genova, Italy, Sep. 24-26, 1991, Genova, Italy, Instituto Int. Comunicazioni, Italy, pp. 617-620, XP002041371, Abstract paragraph 5.
Olive, J.P. and Sproat, R.W., "Text-To-Speech Synthesis," AT&T Technical Journal, vol. 74, pp. 35-44, 1995.
VanSanten, "Assignment of Segmental Duration In Text-To-Speech Synthesis," Computer Speech and Language, vol. 8, pp. 95-128.
Olive, J.P., Greenwood, A., and Coleman, J. "Acoustics of American English Speech," Springer-Verlag, New York, 1993.
Roussas, E.G., A First Course In Mathematical Statistics, Addison-Wesley Publishing Company, Reading, MA, 1973.
Welsh, D.J.A. Matroid Theory, Academic Press, 1976.
Tarjan, R.E. Data Structures and Network Algorithms, CBMS-NSF Regional Conference Series in Applied Mathematics, society for Industrial and Applied Mathematics, Philadelphia, PA, 1993.
Kruskai, J.R. "On the Shorteset Spanning Subtree of a Graph and the Traveling Salesman Problem," Proceedings of the American Mathematical Society, vol. 7, pp. 53-57, 1956.
Nemhauser and Wolsey, Interger and Combinatorial Optimization, John wiley & Sons, 1988.
Greene, D.R. and Knuth, D.E. Mathematics of the Analysis of Algorithms, Birkhauser, Boston, second edition, 1982.
Garey, M.R,. and Johnson, D.S. Computers and Intractability: A Guide to the Theory of NP-Completeness, W.H. Freeman and Company, New York, 1979.
Lund and Yannakakis, "On the Hardness of Approximating Minimization Problems," (extended abstract), Proc. 25th ACM Symp. on theory of Computing, pp. 286-293, 1993.
Golub, G. H. and van Loan, C.F. Matrix Computations, Johns Hopkins Series in the Mathematical Sciences, Johns Hopkins University Press, Baltimore, second edition, 1989.
Barnett, S. Matrices, Methods, and Applications, Oxford Applied Mathematics and Computing Science Series, Clarendon Press, Oxford, 1990.
van Santen, J.P., "Assignment of segmental duration in text-to-speech synthesis", Computer Speech and Language (1994) 8, 95-128.
Sproat, et al., "Text-to-Speech Synthesis", AT&T Technical Journal (To appear).
Buchsbaum Adam Louis
VanSanten Jan Pieter
Edouard Patrick N.
Isen Forester W.
Lucent Technologies - Inc.
LandOfFree
System and method for selecting training text does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with System and method for selecting training text, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for selecting training text will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-178736