Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2006-03-14
2006-03-14
Abebe, Daniel (Department: 2655)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S256000
Reexamination Certificate
active
07013276
ABSTRACT:
Predicting speech recognizer confusion where utterances can be represented by any combination of text form and audio file. The utterances are represented with an intermediate representation that directly reflects the acoustic characteristics of the utterances. Text representations of the utterances can be directly used for predicting confusability without access to audio file examples of the utterances. First embodiment: two text utterances are represented with strings of phonemes and one of the strings of phonemes is transformed into the other strings of phonemes for a least cost as a confusability measure. Second embodiment: two utterances are represented with an intermediate representation of sequences of acoustic events based on phonetic capabilities of speakers obtained from acoustic signals of the utterances and the acoustic events are compared. Predicting confusability of the utterances according to a formula 2K/(T), K is a number of matched acoustic events and T is a total number of acoustic events.
REFERENCES:
patent: 4972485 (1990-11-01), Dautrich et al.
patent: 5097509 (1992-03-01), Lennig
patent: 5452397 (1995-09-01), Ittycheriah et al.
patent: 5638425 (1997-06-01), Meador, III et al.
patent: 5664058 (1997-09-01), Vysotsky
patent: 5737723 (1998-04-01), Riley et al.
patent: 5778344 (1998-07-01), Attwater et al.
patent: 5799276 (1998-08-01), Komissarchik et al.
patent: 5960393 (1999-09-01), Cohrs et al.
patent: 5987411 (1999-11-01), Petroni et al.
patent: 6014624 (2000-01-01), Raman
patent: 6049594 (2000-04-01), Furman et al.
patent: 6073099 (2000-06-01), Sabourin et al.
patent: 6122361 (2000-09-01), Gupta
patent: 6134527 (2000-10-01), Meunier
patent: 6185530 (2001-02-01), Ittycheriah et al.
patent: 6360197 (2002-03-01), Wu et al.
Stevens, Kenneth, N.,From Acoustic Cues To Segments, Features, and Words, Proc. 6thInternational Conference on Spoken Language Processing (ICSLP 2000), Beijing China, Oct. 16-20, 2000—pp. 1-8.
Stevens, K.N (1992)Lexical access from features, MIT Speech Communication Group Working Papers, VIII, 119-144.
Stevens, K.N., Manuel, S.Y., Shattuck-Hufnagel, S., and Liu, S. (1992),Implementation of a model for lexical access based on features, in J.J. Ohala, T.M. Nearey, G.L. Derwing, M.M. Hodge, and G.E. Wiebe (Eds.), Proceedings of the 1992 International Conference on Spoken Language Processing, Edmonton, Canada: University of Alberta- pp. 499-502.
Bitar, N., and Espy-Wilson, C. (1995),A signal representation of speech based on phonetic features, Proceedings of IEEE Dual-Use Technology and Applications Conference, 310-315.
Ariel Salomon and Carol Espy-Wilson (1999),Automatic Detection of manner events based on temporal parameters, Proc. Eurospeech, Sep. '99, pp. 2797-2800.
Wagner, RA and Fischer, MJ (1974),The string-to-string correction problem, Journal of the Association for Computing Machinery, 21, 168-173.
Cormen, Leiserson, and Rivest (1990),Introduction to Algorithms, Cambridge, MA: MIT Press, Chapter 34, pp. 853-885.
Greenberg, S. (2000),Understanding Spoken Language using Statistical and Computational Methods, Presented at Patterns of Speech Sounds in Unscripted Communication—Production, Perception, Phonology, Akademie Sandelmark, Germany, Oct. 8-11.
Greenberg, S., and S. Chang. (2000),Linguistic Dissection of Switchboard-Corpus: Automatic Speech Recognition Systems, Presented at the ISCA Workshop on Automatic Speech Recognition: Challenges for the New Millennium, Paris, Sep. 18-20, 2000.
Liu, S. (1995),Landmark Detection for Distinctive Feature-based Speech Recognition, Ph.D. Thesis, Cambridge, MA: Massachusetts Institiute of Technology.
Syrdal, Ann K. (1984),Aspects of an auditory representation of American English vowels, Speech Communication Group Working Papers, vol. IV, Research Laboratory of Electronics, Massachusetts Institute of Technology, pp. 27-41.
Nearey, T.M. and Assmann, P. (1986)Modeling the role of vowel inherent spectral change in vowel identification, Journal of the Acoustical Society of America 80, pp. 1297-1308.
Fell, H.J., L.J. Ferrier, C. Espy-Wilson, S.G. Worst, E.A. Craft, K. Chenausky, J. MacAuslan, and G. Hennessey (2000),Analysis of Infant Babbles by the Early Vocalization Analyzer, Presented at the American Speech-Language-Hearing Convention, Nov. 17, 2000.
Fell, H.J., J. MacAusian, K. Chenausky, and L.J. Ferrier (1998),Automatic Babble Recognition for Early Detection of Speech Related Disorders, Assets'98, Proceedings of the Third Internatinal ACM SIGCAPH Conference on Assistive Technologies, Marina del Rey, CA.
Fell, H.J., L.J. Ferrier, Z. Mooraj, E. Benson, and D.Schneider (1996),EVA, an Early Vocalization Analyzer, An Empirical Validity Study of Computer Categorization, Assets '96, Proceedings of the Third International ACM SIGCAPH Conference on Assistive Technologies.
Hillenbrand, J.M., and M.J. Clark (2000),Effects of consonant environment on vowel formant patterns, J. Acoust. Soc. Am. 109(2), pp. 748-763.
Howitt, A.W. (1991),Application of the Wigner distribution to speech analysis, MIT Speech Communication Group Working Papers, VII, pp. 23-46.
International Computer Science Institute,Welcome to the NIST Scoring Tooklit Version 0.1, Sclite—score speech recognition system output(http://www.icsi.berkeley.edu/speech/docs/sctk-1.2/sclite.htm), Sclite Revision.txt, Index of/speech/docs/sctk-1.2 (program documentation for software tool known as “sclite” as part of NIST Scoring Toolkit (SCTK) software tools), Apr. 6, 1998.
Bickley Corine A.
Denenberg Lawrence A.
Abebe Daniel
Comverse Inc.
Staas & Halsey , LLP
LandOfFree
Method of assessing degree of acoustic confusability, and... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method of assessing degree of acoustic confusability, and..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method of assessing degree of acoustic confusability, and... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3584855