Data processing: speech signal processing – linguistics – language – Speech signal processing – Synthesis
Reexamination Certificate
2004-09-28
2009-12-22
Abebe, Daniel D (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Synthesis
C704S258000, C704S265000, C345S473000, C345S474000
Reexamination Certificate
active
07636662
ABSTRACT:
A system and method is provided for synthesizing audio-visual content in a video image processor. A content synthesis application processor extracts audio features and video features from audio-visual input signals that represent a speaker who is speaking. The processor uses the extracted visual features to create a computer generated animated version of the face of the speaker. The processor synchronizes facial movements of the animated version of the face of the speaker with a plurality of audio logical units such as phonemes that represent the speaker's speech. In this manner the processor synthesizes an audio-visual representation of the speaker's face that is properly synchronized with the speaker's speech.
REFERENCES:
patent: 6052132 (2000-04-01), Christian et al.
patent: 6366885 (2002-04-01), Basu et al.
patent: 6449595 (2002-09-01), Arslan et al.
patent: 6539354 (2003-03-01), Sutton et al.
patent: 6661418 (2003-12-01), McMillan et al.
patent: 6735566 (2004-05-01), Brand
patent: 6772122 (2004-08-01), Jowitt et al.
patent: 6839672 (2005-01-01), Beutnagel et al.
patent: 7123262 (2006-10-01), Francini et al.
patent: 7149686 (2006-12-01), Cohen et al.
patent: 7168953 (2007-01-01), Poggio et al.
patent: 2002/0008716 (2002-01-01), Colburn et al.
patent: 2003/0149659 (2003-08-01), Danaher et al.
patent: 2004/0021683 (2004-02-01), Huang et al.
patent: 2005/0057570 (2005-03-01), Cosatto et al.
patent: 2006/0204060 (2006-09-01), Huang et al.
patent: WO 02/05114 (2002-01-01), None
W.R. Rabiner et al, “Object Tracking Using Motion-Adaptive Modeling of Scene Content”, Proceedings of Globecom, vol. 2, pp. 877-881, 1996.
G. Hager et al, “The XVision System: A General Purpose Substrate fo Portable Real-Time Vision Applications”, Computer Visionand Understanding, vol. 69, No. 1, pp. 23-37, 1997.
D. Li et al, “Classification of General Audio Data for Content-Based Retrieval”, Pattern Recognition Letters, vol. 22, No. 5, pp. 533-544, 2001.
L.R. Rabiner, “A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition”, Proceedings of the IEEE, vol. 77, pp. 257-285, 1989.
M. Brand, “Voice Puppetry”, Computer Graphics Proceedings, ACM SIGGRAPH, pp. 21-28, Aug. 1999.
D. Li et al, “Content Retrieval Based on Semantic Association”, filed Nov. 15, 2002.
S. Curinga, “Lip Movements Synthesis USing Time-Delay” Proceedings of the European Signal Processing Conference, 1996.
T. Masuko et al, “Text-to-Visual Speech Synthesis Based on parameter Generation From HMM”, IEEE 1998, pp. 3745-3748.
Tsuhan Chen; “Audio-Visual Itegration in Multimodal Communication”, Proceedings of the IEEE, vol. 86, No. 5, May 1998.
Dimtrova Nevenka
Li Dongge
Miller Andrew
Abebe Daniel D
Koninklijke Philips Electronics , N.V.
LandOfFree
System and method for audio-visual content synthesis does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with System and method for audio-visual content synthesis, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for audio-visual content synthesis will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4141209