Face recognition from video images

Image analysis – Applications – Target tracking or detecting

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C382S117000, C382S118000, C382S190000, C382S209000, C382S276000, C342S090000, C348S169000

Reexamination Certificate

active

06301370

ABSTRACT:

FIELD OF THE INVENTION
The present invention relates to vision-based object detection and tracking, and more particularly, to systems for detecting objects in video images, such as human faces, and tracking and identifying the objects in real time.
BACKGROUND OF THE INVENTION
Recently developed object and face recognition techniques include the use of elastic bunch graph matching. The bunch graph recognition technique is highly effective for recognizing faces when the image being analyzed is segmented such that the face portion of the image occupies a substantial portion of the image. However, the elastic bunch graph technique may not reliably detect objects in a large scene where the object of interest occupies only a small fraction of the scene. Moreover, for real-time use of the elastic bunch graph recognition technique, the process of segmenting the image must be computationally efficient or many of the performance advantages of the recognition technique are not obtained.
Accordingly, there exists a significant need for an image processing technique for detecting an object in video images and preparing the video image for further processing by an bunch graph matching process in a computationally efficient manner. The present invention satisfies these needs.
SUMMARY OF THE INVENTION
The present invention is embodied in an apparatus, and related method, for detecting and recognizing an object in an image frame. The object detection process uses robust and computationally efficient techniques. The object identification and recognition process uses an image processing technique based on model graphs and bunch graphs that efficiently represent image features as jets. The system of the invention is particularly advantageous for recognizing a person over a wide variety of pose angles.
In an embodiment of the invention, the object is detected and a portion of the image frame associated with the object is bounded by a bounding box. The bound portion of the image frame is transformed using a wavelet transformation to generate a transformed image. Nodes associated with distinguishing features of the object defined by wavelet jets of a bunch graph generated from a plurality of representative object images are located on the transformed image. The object is identified based on a similarity between wavelet jets associated with an object image in a gallery of object images and wavelet jets at the nodes on the transformed image.
Additionally, the detected object may be sized and centered within the bound portion of the image such that the detected object has a predetermined size and location within the bound portion and background portions of the bound portion of the image frame not associated with the object prior to identifying the object may be suppressed. Often, the object is a head of a person exhibiting a facial region. The bunch graph may be based on a three- dimensional representation of the object. Further, the wavelet transformation may be performed using phase calculations that are performed using a hardware adapted phase representation.
In an alternative embodiment of the invention, the object is in a sequence of images and the step of detecting an object further includes tracking the object between image frames based on a trajectory associated with the object. Also, the step of locating the nodes includes tracking the nodes between image frames and reinitializing a tracked node if the node's position deviates beyond a predetermined position constraint between image frames. Additionally, the image frames may be stereo images and the step of detecting may include detecting convex regions which are associated with head movement.
Other features and advantages of the present invention should be apparent from the following description of the preferred embodiments, taken in conjunction with the accompanying drawings, which illustrate, by way of example, the principles of the invention.


REFERENCES:
patent: 4725824 (1988-02-01), Yoshioka
patent: 4805224 (1989-02-01), Koezuka et al.
patent: 4827413 (1989-05-01), Baldwin et al.
patent: 5159647 (1992-10-01), Burt
patent: 5168529 (1992-12-01), Peregrim et al.
patent: 5187574 (1993-02-01), Kosemura et al.
patent: 5220441 (1993-06-01), Gerstenberger
patent: 5280530 (1994-01-01), Trew et al.
patent: 5333165 (1994-07-01), Sun
patent: 5383013 (1995-01-01), Cox
patent: 5430809 (1995-07-01), Tomitaka
patent: 5432712 (1995-07-01), Chan
patent: 5511153 (1996-04-01), Azarbayejani et al.
patent: 5533177 (1996-07-01), Wirtz et al.
patent: 5550928 (1996-08-01), Lu et al.
patent: 5581625 (1996-12-01), Connell
patent: 5588033 (1996-12-01), Yeung
patent: 5680487 (1997-10-01), Markandey
patent: 5699449 (1997-12-01), Javidi
patent: 5714997 (1998-02-01), Anderson
patent: 5715325 (1998-02-01), Bang et al.
patent: 5719954 (1998-02-01), Onda
patent: 5736982 (1998-04-01), Suzuki et al.
patent: 5764803 (1998-06-01), Jacquin et al.
patent: 5774591 (1998-06-01), Black et al.
patent: 5802220 (1998-09-01), Black et al.
patent: 5809171 (1998-09-01), Neff et al.
patent: 5828769 (1998-10-01), Burns
patent: 5917937 (1999-06-01), Szeliski et al.
patent: 5982853 (1999-11-01), Liebermann
patent: 5995119 (1999-11-01), Cosatto et al.
patent: 6044168 (2000-03-01), Tuceryan et al.
patent: 6052123 (2000-04-01), Lection et al.
patent: 44 06 020 C1 (1995-06-01), None
patent: 0 807 902 A2 (1997-11-01), None
Sara, R. et al “3-D Data Acquision and Interpretation for Virtual Reality and Telepresence”, Proceedings IEEE Workshop Computer Vision for Virtual Reality Based Human Communication, Bombay, Jan. 1998, 7 pp.
Sara, R. et al “Fish-Scales: Representing Fuzzy Manifolds”, Proceedings International Conference Computer Vision, ICCV '98, pp. 811-817, Bombay, Jan. 1998.
Wiskott, L. “Phantom Faces for Face Analysis”,Pattern Recognition, vol. 30, No. 6, pp. 837-846, 1997.
Wurtz, R., “Object Recognition Robust Under Translations, Deformations, and Changes in Background”,IEEE Transactions on Patern Analysis and Machine Intelligence, vol. 19, No. 7, Jul. 1997, pp. 769-775.
Akimoto, T., et al, “Automatic Creation of 3-D Facial Models”,IEEE Computer Graphics & Applications., pp. 16-22, Sep. 1993.
Ayache, N., et al, “Rectification of Images for Binocular and Trinocular Stereovision”, In IEEE Proceedings of 9th International Conference on Pattern Recognition, pp. 11-16, 1988, Italy.
Belhumeur, P., “A Bayesian Approach to Binocular Stereopsis”,International Journal of Computer Vision, 19 (3), 1996, pp.237-260.
Beymer, D. J., “Face Recognition Under Varying Pose”, Massachusettes Institute of Technology, Artificial Intelligence Laboratory, A.I. Memo No. 1461, 12/93, pp. 1-13.
Beymer, D. J., “Face Recognition Under Varying Pose”, Massachusetts Institute of Technology, Artificial Intelligence Laboratory research report, 1994, pp. 756-761.
Buhmann, J. et al, “Distortion Invariant Object Recognition by Matching Hierarchically Labeled Graphs”, In Proceedings IJCNN International Conference of Neural Networks, Wash., DC, Jun. 1989, pp. 155-159.
DeCarlo, D., et al, “The Integration of Optical Flow and Deformable Models with Applications to Human Face Shape and Motion Estimation”, pp. 1-15, In Proceedings,CVPR '96, pp. 231-238.
Devernay, F., et al, “Computing Differential Properties of 3-D Shapes from Stereoscopic Images without 3-D Models”,INRIA, RR-2304, 1994, pp. 1-28.
Dhond, U., et al, “Structure from Stereo-A Review”,IEEE Transactions on Systems, Man, and Cybernetics, vol. 19, No.6, pp. 1489-1510, Nov./Dec. 1989.
Fleet, D. J., et al, “Computation of Component Image Velocity from Local Phase Information”,International Journal of Computer Vision, vol. 5, No. 1, 1990, pp. 77-104.
Fleet, D.J., “Measurement of Image Velocity”, Kluwer International Series in Engineering and Computer Science, Kluwer Academic Publishers, Boston, 1992, No. 169, pp. 1-203.
Hall, E. L., “Computer Image Processing and Recognition”, Academic Press, 1979, pp 468-484.
Hong, H., et al, “Online Facial Recognition Based on Personalized Gallery”, Proceedings of International Conference on Automatic Face and Gesture Recog

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Face recognition from video images does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Face recognition from video images, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Face recognition from video images will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2553658

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.