Method and apparatus for predicting events in video...

Television – Two-way video and voice communication – Display arrangement

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C348S014050, C348S169000, C348S211990

Reexamination Certificate

active

06894714

ABSTRACT:
Methods and apparatus are disclosed for predicting events using acoustic and visual cues. The present invention processes audio and video information to identify one or more (i) acoustic cues, such as intonation patterns, pitch and loudness, (ii) visual cues, such as gaze, facial pose, body postures, hand gestures and facial expressions, or (iii) a combination of the foregoing, that are typically associated with an event, such as behavior exhibited by a video conference participant before he or she speaks. In this manner, the present invention allows the video processing system to predict events, such as the identity of the next speaker. The predictive speaker identifier operates in a learning mode to learn the characteristic profile of each participant in terms of the concept that the participant “will speak” or “will not speak” under the presence or absence of one or more predefined visual or acoustic cues. The predictive speaker identifier operates in a predictive mode to compare the learned characteristics embodied in the characteristic profile to the audio and video information and thereby predict the next speaker.

REFERENCES:
patent: 4980761 (1990-12-01), Natori
patent: 5600765 (1997-02-01), Ando et al.
patent: 5844599 (1998-12-01), Hildin
patent: 5940118 (1999-08-01), Van Schyndel
patent: 5959667 (1999-09-01), Maeng
patent: 6005610 (1999-12-01), Pingali
patent: 6072494 (2000-06-01), Nguyen
patent: 6219086 (2001-04-01), Murata
patent: 6275258 (2001-08-01), Chim
patent: 6392694 (2002-05-01), Bianchi
patent: 6496799 (2002-12-01), Pickering
patent: 6593956 (2003-07-01), Potts et al.
patent: 9743857 (1997-11-01), None
patent: WO0182626 (2001-01-01), None
Frank Dellaert et al., “Recognizing Emotion in Speech”, in Proc. of Int'l Conf. on Speech and Language Processing (1996).
Egor Elagin et al., “Automatic Pose Estimation System for Faces based on Bunch Graph Matching Technology”, Proc. of the 3d Int'l Conf. on Automatic Face and Gesture Recognition, vol. I, 136-141, Nara, Japan (Apr. 14-16, 1998).

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method and apparatus for predicting events in video... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method and apparatus for predicting events in video..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for predicting events in video... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3428483

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.