Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2011-08-02
2011-08-02
Vo, Huyen X. (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S244000, C704S255000
Reexamination Certificate
active
07991615
ABSTRACT:
Described is the use of acoustic data to improve grapheme-to-phoneme conversion for speech recognition, such as to more accurately recognize spoken names in a voice-dialing system. A joint model of acoustics and graphonemes (acoustic data, phonemes sequences, grapheme sequences and an alignment between phoneme sequences and grapheme sequences) is described, as is retraining by maximum likelihood training and discriminative training in adapting graphoneme model parameters using acoustic data. Also described is the unsupervised collection of grapheme labels for received acoustic data, thereby automatically obtaining a substantial number of actual samples that may be used in retraining. Speech input that does not meet a confidence threshold may be filtered out so as to not be used by the retrained model.
REFERENCES:
patent: 6078885 (2000-06-01), Beutnagel
patent: 6094633 (2000-07-01), Gaved et al.
patent: 6230131 (2001-05-01), Kuhn et al.
patent: 7107216 (2006-09-01), Hain
patent: 7216079 (2007-05-01), Barnard et al.
patent: 7266495 (2007-09-01), Beaufays et al.
patent: 7280964 (2007-10-01), Wilson et al.
patent: 7406417 (2008-07-01), Hain
patent: 2005/0203739 (2005-09-01), Hwang et al.
patent: 2006/0031069 (2006-02-01), Huang et al.
patent: 2006/0064177 (2006-03-01), Tian et al.
patent: 2006/0215821 (2006-09-01), Rokusek et al.
patent: 2006/0265220 (2006-11-01), Massimino
International Search Report and Written Opinion for PCT Application No. PCT/US2008/083249, mailed on Apr. 28, 2009, 10 pages.
Bisani et al., “Investigations on Joint-Multigram Models for Grapheme-to-Phoneme Conversion”, In: 7th International Conference on Spoken Language Processing, Sep. 2002, pp. 105-108.
Chen, Stanley F., “Conditional and Joint Models for Grapheme-to-Phoneme Conversion”, In: Proceedings of Eurospeech 2003, Sep. 2003, pp. 2033-2036.
Bisani, “Multigram-based Grapheme-to-Phoneme Conversion for LVCSR”, pp. 1-4.
Bellegarda Jerome R., “A Novel Approach to Unsupervised Grapheme- to-Phoneme Conversion”, pp. 1-4.
Decadt, et al., “Optimizing Phoneme-to-Grapheme Conversion for Out-of-Vocabulary Words in Speech recognition”, Date: 2001, pp. 1-9.
Acero Alejandro
Gunawardana Asela J. R.
Li Xiao
Microsoft Corporation
Vo Huyen X.
LandOfFree
Grapheme-to-phoneme conversion using acoustic data does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Grapheme-to-phoneme conversion using acoustic data, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Grapheme-to-phoneme conversion using acoustic data will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2689257