Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2006-12-12
2006-12-12
Lerner, Martin (Department: 2654)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S278000
Reexamination Certificate
active
07149686
ABSTRACT:
A system and method for eliminating synchronization errors using speech recognition. Using separate audio and visual speech recognition techniques, the inventive system and method identifies visemes, or visual cues which are indicative of articulatory type, in the video content, and identifies phones and their articulatory types in the audio content. Once the two recognition techniques have been applied, the outputs are compared to determine the relative alignment and, if not aligned, a synchronization algorithm is applied to time-adjust one or both of the audio and the visual streams in order to achieve synchronization.
REFERENCES:
patent: 5608839 (1997-03-01), Chen
patent: 5835667 (1998-11-01), Wactlar et al.
patent: 5844600 (1998-12-01), Kerr
patent: 6219640 (2001-04-01), Basu et al.
patent: 6317716 (2001-11-01), Braida et al.
patent: 6366885 (2002-04-01), Basu et al.
patent: 6505153 (2003-01-01), Van Thong et al.
patent: 6510279 (2003-01-01), Morishita
patent: 6697120 (2004-02-01), Haisma et al.
patent: 6839672 (2005-01-01), Beutnagel et al.
patent: 6862569 (2005-03-01), Basso et al.
G. David Forney, Jr. “The Viterbi Algorithm.” Proceedings of the IEEE, Mar. 1973.
Lalit R. Bahl, Member, IEEE; Frederick Jelinek, Fellow, IEEE; and Robert L. Mercer. “A Maximum Likelihood Approach to Continuous Speech Recognition.”IEEE Transactions on Pattern Analysis and Machine Intelligence. vol. PAMI-5, No. 2, Mar. 1983.
Andrew J. Viterbi. “Error Bounds for Convolutional Codes and as Asymptotically Optimum Decoding Algorithm.”IEEE Transactions on Information Theory, Apr. 1967.
Ashish Verma, Tanveer Faruquie, Chalapathy Neti, Sankar Basu and Andrew Senior. “Late Integration in Audio-Visual Continuous Speech Recognition.”
S. Basu, C. Neti, N. Rajput, A. Senior, L. Subramaniam, and A. Verma. “Audio-visual Large Vocabulary Continuous Speech Recognition in the Broadcast Domain.”IEEE Multimedia Signal Processing, Sep. 1999.
Cohen Paul S.
Dildine John R.
Gleason Edward J.
Dang Thu Ann
Dougherty Anne Vachon
Lerner Martin
LandOfFree
System and method for eliminating synchronization errors in... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with System and method for eliminating synchronization errors in..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for eliminating synchronization errors in... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3666483