Multimodal identification and tracking of speakers in video

Image analysis – Image transformation or preprocessing – Image storage or retrieval

Reexamination Certificate

Rate now

[ 0.00 ] – not rated yet Voters 0 Comments 0

Details Multimodal identification and tracking of speakers in video Multimodal identification and tracking of speakers in video

: 2011-04-05
: 2011-04-05
: Rahmjoo, Mike (Department: 2624)
: Image analysis
: Image transformation or preprocessing
: Image storage or retrieval

: C382S284000, C382S286000, C382S294000, C382S307000, C715S716000, C715S719000, C715S720000, C715S722000, C715S723000, C725S032000, C725S037000
: Reexamination Certificate
: active
: 07920761
: ABSTRACT:
A computer program product includes machine readable instructions for providing enhanced video output by: receiving footage including likeness information in a plurality of modalities; demultiplexing the plurality of modalities to provide information for each modality; comparing information from at least two of the modalities for determining a correlation in the likeness information; using the correlation, obtaining semantic information for association with the likeness; and combining the semantic information with the likeness information for providing the enhanced video output. A system for implementing the computer program product includes resources for receiving the footage.

REFERENCES:
patent: 4449189 (1984-05-01), Feix et al.
patent: 5625704 (1997-04-01), Prasad
patent: 6317716 (2001-11-01), Braida et al.
patent: 2003/0198256 (2003-10-01), Wang et al.
patent: 2005/0047664 (2005-03-01), Nefian et al.
patent: 2006/0059120 (2006-03-01), Xiong et al.
patent: 2006/0204060 (2006-09-01), Huang et al.
C. Neti, G. Potamianos, J. Leuttin, I. Matthews, H. Glotin, D. Vergyri, J. Sisson, A. Mashari, and J. Zhou, “Audio-visual speech recognition,” CLSP Summer Workshop Tech. Rep. WS00AVSR, Johns-Hopkins University, Baltimore, MD, 2000.
John Hershey and Javier Movellan, “Using audio-visual synchrony to locate sounds,” in Proc. NIPS, 1999. www.cs.cmu.edu/Groups/NIPS/NIPS99/99papers-pub-on-web/Named-gz/HersheyMovellan.ps.gz.
John W. fisher, Trevor Darrell, William Freeman and Paul Viola, “Learning Joint Statistical Models for Audio-Visual Fusion and Segregation”, Advances in Neural Information Processing Systems, Denver, Colorado, Nov. 28-Dec. 2, 2000.
Iyengar, G. Nock, H.J. Neti, C., “Audio-visual synchrony for detection of monologues in video archives”, ICME 2003.
A. Smeaton, P. Over and W. Kraaij, “TRECVID: evaluating the effectiveness of information retrieval tasks on digital video”, Proceedings of the 12th annual ACM international conference on Multimedia, pp. 652-655, 2004, ISBN:1-58113-893-8.
Martin, A., and M. Przybocki. 2000. The NIST 1999 speaker recognition evaluation-An overview, Digital Signal Processing vol. 10, pp. 1-18.
Face Detection: A Survey, Erik Hjelm, Computer Vision and Image Understanding 83, 236-274 (2001).

Affiliated with

Amir Arnon

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Iyengar Giridharan

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Zilca Ran D.

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Also associated with

Cantor & Colburn LLP

Law Firm

[ 0.00 ] – not rated yet Voters 0 Comments 0

Dougherty Ann

Attorney

[ 0.00 ] – not rated yet Voters 0 Comments 0

International Business Machines - Corporation

Corporate Assignee

[ 0.00 ] – not rated yet Voters 0 Comments 0

Rahmjoo Mike

Examiner

[ 0.00 ] – not rated yet Voters 0 Comments 0

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Multimodal identification and tracking of speakers in video does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Multimodal identification and tracking of speakers in video, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Multimodal identification and tracking of speakers in video will most certainly appreciate the feedback.

Rate now

Comments { 0 }

Profile ID: LFUS-PAI-O-2728380

All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.

Canada

Charities
Companies
MP Candidates
Patents
Employee Salary Disclosure

World

Places of the World
Scientific Papers

United States

Banks
Companies
Counties
Patents
Employee Salary Disclosure