System and method for selecting training text

Data processing: speech signal processing – linguistics – language – Speech signal processing – Synthesis

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

704255, G10L 502, G10L 400

Patent

active

060385332

ABSTRACT:
A system and method are described for determining a near-optimum subset of data, based on a selected model, from a large corpus of data. Sets of feature vectors corresponding to natural or other preselected divisions of the data corpus are mapped into matrices representative of such divisions. The invention operates to find a submatrix of full rank formed as a union of one or more of those division-based matrices. A greedy algorithm utilizing Gram-Schmidt orthonormalization operates on the division matrices to find a near optimum submatrix and in a time bound representing a substantial improvement over prior-art methods. An important application of the invention is the selection of a small number of sentences from a corpus of a very large number of such sentences from which the parameters of a duration model for speech synthesis can be estimated.

REFERENCES:
patent: 4979216 (1990-12-01), Malsheen et al.
patent: 5204905 (1993-04-01), Mitone
patent: 5230037 (1993-07-01), Giustiani et al.
patent: 5268990 (1993-12-01), Cohen et al.
patent: 5581655 (1996-12-01), Cohen et al.
Van Santen, J P H "Perceptual Experiments for Diagnostic Testing of Text to Speech Systems", Computer Speech and Language, vol. 7, No. 1., Jan. 1, 1993 pp. 49-100, XP000354661, Abstract paragraph 2.1.2.
Van Santen, J P H et al. "The Analysis of Contextual Effects on Segmental Duration"; Computer Speech and Language, vol. 4, No. 4., Oct. 1, 1990, pp. 359-390, XP000202888, Abstract, Paragraphs 3.1 and 3.2.
Macarron, A et al. "Generation of Duration Rules for Spanish Text to Speech Synthesizer", Eurospeech 91, 2nd European Conference on Speech Communication and Technology Proceedings, Genova, Italy, Sep. 24-26, 1991, Genova, Italy, Instituto Int. Comunicazioni, Italy, pp. 617-620, XP002041371, Abstract paragraph 5.
Olive, J.P. and Sproat, R.W., "Text-To-Speech Synthesis," AT&T Technical Journal, vol. 74, pp. 35-44, 1995.
VanSanten, "Assignment of Segmental Duration In Text-To-Speech Synthesis," Computer Speech and Language, vol. 8, pp. 95-128.
Olive, J.P., Greenwood, A., and Coleman, J. "Acoustics of American English Speech," Springer-Verlag, New York, 1993.
Roussas, E.G., A First Course In Mathematical Statistics, Addison-Wesley Publishing Company, Reading, MA, 1973.
Welsh, D.J.A. Matroid Theory, Academic Press, 1976.
Tarjan, R.E. Data Structures and Network Algorithms, CBMS-NSF Regional Conference Series in Applied Mathematics, society for Industrial and Applied Mathematics, Philadelphia, PA, 1993.
Kruskai, J.R. "On the Shorteset Spanning Subtree of a Graph and the Traveling Salesman Problem," Proceedings of the American Mathematical Society, vol. 7, pp. 53-57, 1956.
Nemhauser and Wolsey, Interger and Combinatorial Optimization, John wiley & Sons, 1988.
Greene, D.R. and Knuth, D.E. Mathematics of the Analysis of Algorithms, Birkhauser, Boston, second edition, 1982.
Garey, M.R,. and Johnson, D.S. Computers and Intractability: A Guide to the Theory of NP-Completeness, W.H. Freeman and Company, New York, 1979.
Lund and Yannakakis, "On the Hardness of Approximating Minimization Problems," (extended abstract), Proc. 25th ACM Symp. on theory of Computing, pp. 286-293, 1993.
Golub, G. H. and van Loan, C.F. Matrix Computations, Johns Hopkins Series in the Mathematical Sciences, Johns Hopkins University Press, Baltimore, second edition, 1989.
Barnett, S. Matrices, Methods, and Applications, Oxford Applied Mathematics and Computing Science Series, Clarendon Press, Oxford, 1990.
van Santen, J.P., "Assignment of segmental duration in text-to-speech synthesis", Computer Speech and Language (1994) 8, 95-128.
Sproat, et al., "Text-to-Speech Synthesis", AT&T Technical Journal (To appear).

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

System and method for selecting training text does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with System and method for selecting training text, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for selecting training text will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-178736

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.