Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2008-01-15
2008-01-15
Hudspeth, David (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
Reexamination Certificate
active
07319955
ABSTRACT:
An arrangement for yielding enhanced audio features towards the provision of enhanced audio-visual features for speech recognition. Input is provided in the form of noisy audio-visual features and noisy audio features related to the noisy audio-visual features.
REFERENCES:
patent: 4449189 (1984-05-01), Feix et al.
patent: 4757541 (1988-07-01), Beadles
patent: 5412738 (1995-05-01), Brunelli et al.
patent: 5621858 (1997-04-01), Stork et al.
patent: 6594629 (2003-07-01), Basu et al.
patent: 2002/0113687 (2002-08-01), Center et al.
patent: 2002/0116197 (2002-08-01), Erten
Deligne et al., “Audio-visual speech enhancement with AVCDCN”, Sensor Array and Multichannel Signal Processing Workshop Proceedings, Aug. 4-6, 2002 pp. 68-71.
Girin et al., “Audiovisual speech enhancement: new advances using multi-layer perceptrons”, IEEE Second Workshop on Multimedia Signal Processing, Dec. 7-9, 1998 pp. 77-82.
Girin et al., “Fusion if auditory and visual information for noisy speech enhancement: a preliminary study of vowel transitions”, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 2, May 12-15, 1998 pp. 1005-1008.
Deng et al., “High-Performance Robust Speech Recognition Using Stereo Training Data”, Proceedings of ICASSP 2001, May 2001.
Potamianos et al., “Hierarchical Discriminant Features for Audio-Visual LVCSR”, Proceedings of ICASSP 2001, May 2001.
Neti et al., “Audio-Visual Speech Recognition, Final Workshop Report”, Center for Language and Speech Processing, 2000.
Acero et al., “Environmental Robustness in Automatic Speech Recognition”, Proceedings of ICASSP'90, pp. 849-852, 1990.
Acero, Acoustical and Environmental Robustness in Automatic Speech Recognition, PhD thesis, Dept. of Elec. and Comp. Engineering, CMU, Pittsburgh, PA 15213, Sep. 1990.
Girin et al., “Audio-Visual Enhancement of Speech in Noise”, Journal of the Accoustical Society of America, vol. 6, n. 109, pp. 3007-3020, 2001.
Goecke et al., “Noisy Audio Feature Enhancement Using Audio-Visual Speech Data”, Proceedings of ICASSP'02, 2002.
L. Rabiner et al., Fundamentals of Speech Recognition, Prentice Hall Signal Processing Series, Chapter 3, 1993.
Bahl et al., Performance of the IBM Large Vocabulary Continuous Speech Recognition System on the ARPA Wall Street Journal Task, Proc. of ICASSP 1995, vol. 1, pp. 41-45, 1995.
Deligne Sabine
Neti Chalapathy V.
Potamianos Gerasimos
Albertalli Brian L.
Ference & Associates LLC
Hudspeth David
International Business Machines - Corporation
LandOfFree
Audio-visual codebook dependent cepstral normalization does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Audio-visual codebook dependent cepstral normalization, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Audio-visual codebook dependent cepstral normalization will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2791254