Image analysis – Applications – Personnel identification
Patent
1994-11-10
1997-04-29
Couso, Jose L.
Image analysis
Applications
Personnel identification
382115, 382170, G06K 900
Patent
active
056257040
ABSTRACT:
A speaker recognition method uses visual image representations of mouth movements associated with the generation of an acoustic utterance by a speaker that is the person to be recognized. No acoustic data is used and normal ambient lighting conditions are used. The method generates a spatiotemporal gray-level function representative of the spatiotemporal inner month area confined between the lips during the utterance from which a cue-block is generated that isolates the essential information from which a feature vector for recognition is generated. The feature vector includes utterance duration, maximum lip-to-lip separation, and location in time, or speed of lip movement opening, speed of lip movement closure, and a spatiotemporal area measure representative of the area enclosed between the lips during the utterance and representative of the frontal area of the oral cavity during the utterance. Experimental data shows distinct clustering in feature space for different speakers.
REFERENCES:
patent: 4841575 (1989-06-01), Welsh et al.
patent: 4975960 (1990-12-01), Petajan
patent: 4975969 (1990-12-01), Tal
patent: 4975978 (1990-12-01), Ando et al.
patent: 5136659 (1992-08-01), Kaneko et al.
-Ashok Samal et al., "Automatic Recognition and Analysis of Human Faces and Facial Expressions: A Survey," Pattern Recognition, vol. 25, No. 1, pp. 65-77 (1992).
-Harry McGurk et al., "Hearing Lips and Seeing Voices," Nature, vol. 264, pp. 746-748 (Dec. 23/30, 1976).
Bella Matthew C.
Couso Jose L.
Ricoh & Company, Ltd.
Ricoh Corporation
LandOfFree
Speaker recognition using spatiotemporal cues does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Speaker recognition using spatiotemporal cues, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speaker recognition using spatiotemporal cues will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-712628