Audio-visual selection process for the synthesis of...

Computer graphics processing and selective visual display system – Computer graphics processing – Animation

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C345S956000, C345S957000

Reexamination Certificate

active

07990384

ABSTRACT:
A system and method for generating photo-realistic talking-head animation from a text input utilizes an audio-visual unit selection process. The lip-synchronization is obtained by optimally selecting and concatenating variable-length video units of the mouth area. The unit selection process utilizes the acoustic data to determine the target costs for the candidate images and utilizes the visual data to determine the concatenation costs. The image database is prepared in a hierarchical fashion, including high-level features (such as a full 3D modeling of the head, geometric size and position of elements) and pixel-based, low-level features (such as a PCA-based metric for labeling the various feature bitmaps).

REFERENCES:
patent: 4827532 (1989-05-01), Bloomstein
patent: 5657426 (1997-08-01), Waters et al.
patent: 5880788 (1999-03-01), Bregler
patent: 6072496 (2000-06-01), Guenter et al.
patent: 6208356 (2001-03-01), Breen et al.
patent: 6285794 (2001-09-01), Georgiev et al.
patent: 6449595 (2002-09-01), Arslan et al.
patent: 6496594 (2002-12-01), Prokoski
Tony Ezzat and Tomaso Poggio, “Visual Speech Synthesis by Morphing Visemes”, Inter. Journal of Computer Vision 38(1), 45-57 2000, ACM.
Jiang et al., “Visual Speech Analysis and Synthesis with Application to Mandarin Speech Training”, Proceedings of ACM Symposium of Virtual Reality Software and Tech, UK 1999, pp. 111-115.
Matthew Brand, “Voice Puppetry”, ACM SIGGRAPH 99, Los Angeles, CA, pp. 21-28.
H. Hon, A. Acero, X. Huang, J. Liu, and M. Plumpe, “Automatic Generation of Synthesis Units for Trainable Text-To-Speech Systems”, Microsoft Corp. 1998, Pub in: Acoustics, Speech, and Signal Processing, 1998, ICASSP '98. Proc of 1998 IEEE International Conference on, vol. 1, pp. 293-296.
R. V. Cox, et al., “Speech and language processing for next-millenium communications services,” Proc. IEEE, vol. 88, pp. 1314-1337, Aug. 2000.
Tony Ezzat and Tomaso Poggio, “Visual Speech Synthesis by Morphing Visemes”—1999 Version, Artificial Intelligence Laboratory, A.I. Memo No. 1658, C.B.C.L Paper No. 173, May 1999.
Christopher Bregler, Michelle Covell, Malcolm Slaney, “Video Re-write: Driving Visual Speech with Audio”, Siggraph, 97, Los Angeles, CA, Aug. 3-8, 1997.
Andrew Hunt, Alan W. Black, “Unit Selection in a Concatenative Speech Synthesis System Using a Large Speech Database”, ATR Interpreting Telecommunication Research Labs, (continued from previous listing) to appear in Proc. ICASSP-96, May 7-10, Atlanga GA.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Audio-visual selection process for the synthesis of... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Audio-visual selection process for the synthesis of..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Audio-visual selection process for the synthesis of... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2755786

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.