System and method for audio-visual content synthesis

Data processing: speech signal processing – linguistics – language – Speech signal processing – Synthesis

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S258000, C704S265000, C345S473000, C345S474000

Reexamination Certificate

active

07636662

ABSTRACT:
A system and method is provided for synthesizing audio-visual content in a video image processor. A content synthesis application processor extracts audio features and video features from audio-visual input signals that represent a speaker who is speaking. The processor uses the extracted visual features to create a computer generated animated version of the face of the speaker. The processor synchronizes facial movements of the animated version of the face of the speaker with a plurality of audio logical units such as phonemes that represent the speaker's speech. In this manner the processor synthesizes an audio-visual representation of the speaker's face that is properly synchronized with the speaker's speech.

REFERENCES:
patent: 6052132 (2000-04-01), Christian et al.
patent: 6366885 (2002-04-01), Basu et al.
patent: 6449595 (2002-09-01), Arslan et al.
patent: 6539354 (2003-03-01), Sutton et al.
patent: 6661418 (2003-12-01), McMillan et al.
patent: 6735566 (2004-05-01), Brand
patent: 6772122 (2004-08-01), Jowitt et al.
patent: 6839672 (2005-01-01), Beutnagel et al.
patent: 7123262 (2006-10-01), Francini et al.
patent: 7149686 (2006-12-01), Cohen et al.
patent: 7168953 (2007-01-01), Poggio et al.
patent: 2002/0008716 (2002-01-01), Colburn et al.
patent: 2003/0149659 (2003-08-01), Danaher et al.
patent: 2004/0021683 (2004-02-01), Huang et al.
patent: 2005/0057570 (2005-03-01), Cosatto et al.
patent: 2006/0204060 (2006-09-01), Huang et al.
patent: WO 02/05114 (2002-01-01), None
W.R. Rabiner et al, “Object Tracking Using Motion-Adaptive Modeling of Scene Content”, Proceedings of Globecom, vol. 2, pp. 877-881, 1996.
G. Hager et al, “The XVision System: A General Purpose Substrate fo Portable Real-Time Vision Applications”, Computer Visionand Understanding, vol. 69, No. 1, pp. 23-37, 1997.
D. Li et al, “Classification of General Audio Data for Content-Based Retrieval”, Pattern Recognition Letters, vol. 22, No. 5, pp. 533-544, 2001.
L.R. Rabiner, “A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition”, Proceedings of the IEEE, vol. 77, pp. 257-285, 1989.
M. Brand, “Voice Puppetry”, Computer Graphics Proceedings, ACM SIGGRAPH, pp. 21-28, Aug. 1999.
D. Li et al, “Content Retrieval Based on Semantic Association”, filed Nov. 15, 2002.
S. Curinga, “Lip Movements Synthesis USing Time-Delay” Proceedings of the European Signal Processing Conference, 1996.
T. Masuko et al, “Text-to-Visual Speech Synthesis Based on parameter Generation From HMM”, IEEE 1998, pp. 3745-3748.
Tsuhan Chen; “Audio-Visual Itegration in Multimodal Communication”, Proceedings of the IEEE, vol. 86, No. 5, May 1998.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

System and method for audio-visual content synthesis does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with System and method for audio-visual content synthesis, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for audio-visual content synthesis will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4141209

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.