Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2006-06-27
2006-06-27
Harper, V. Paul (Department: 2654)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S258000, C704S276000
Reexamination Certificate
active
07069214
ABSTRACT:
A library of mouth shapes is created by separating speaker-dependent and speaker independent variability. Preferably, speaker dependent variability is modeled by a speaker space while the speaker independent variability (i.e. context dependency), is modeled by a set of normalized mouth shapes that need be built only once. Given a small amount of data from a new speaker, it is possible to construct a corresponding mouth shape library by estimating a point in speaker space that maximizes the likelihood of adaptation data and by combining speaker dependent and speaker independent variability. Creation of talking heads is simplified because creation of a library of mouth shapes is enabled with only a few mouth shape instances. To build the speaker space, a context independent mouth shape parametric representation is obtained. Then a supervector containing the set of context-independent mouth shapes is formed for each speaker included in the speaker space. Dimensionality reduction is used to find the areas of the speaker space.
REFERENCES:
patent: 5608839 (1997-03-01), Chen
patent: 6112177 (2000-08-01), Cosatto et al.
patent: 6188776 (2001-02-01), Covell et al.
patent: 2003/0072482 (2003-04-01), Brand
Bregler et al. “Video Rewrite: Driving Visual Speech with Audio,” AVSP, 1997, pp. 153-156.
Ezzat et al. “MikeTalk: A Talking Facial Display Based on Morphing Visemes,” Proc. of the Computer Animation Conference, Philadelphia, Pa., Jun. 1998.
Shih et al. “Efficient Adaptation of TTS Duration Model to New Speakers,” ICSLP, 1998.
Bregler et al., “Video Rewrite: Driving Visual Speech with Audio” Proc. ACM SIGGRAPH 1997, in Computer Graphics Preceedings, Annual Conference Series, 1997.
Bregler et al., “Video Rewrite: Visual Speech Synthesis from Video” Proc. of the AVSP '97 Workshop, Rhodes (Greece), Sep. 26-27, 1997.
Harness Dickey & Pierce PLC
Harper V. Paul
Matsushita Electric - Industrial Co., Ltd.
LandOfFree
Factorization for generating a library of mouth shapes does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Factorization for generating a library of mouth shapes, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Factorization for generating a library of mouth shapes will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3638978