Method and apparatus for inducing classifiers for multimedia...

Data processing: artificial intelligence – Neural network – Learning task

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Reexamination Certificate

active

06892193

ABSTRACT:
This invention is a system and method to perform categorization (classification) of multimedia items. These items are comprised of a multitude of disparate information sources, in particular, visual information and textual information. Classifiers are induced based on combining textual and visual feature vectors. Textual features are the traditional ones, such as, word count vectors. Visual features include, but are not limited to, color properties of key intervals and motion properties of key intervals. The visual feature vectors are determined in such a fashion that the vectors are sparse. The vector components are features such as the absence or presence of the color green in spatial regions and the absence or the amount of visual flow in spatial regions of the media items. The text and the visual representation vectors are combined in a systematic and coherent fashion. This vector representation of a media item lends itself to well-established learning techniques. The resulting system, subject of this invention, categorizes (or classifies) media items based both on textual features and visual features.

REFERENCES:
patent: 5151788 (1992-09-01), Blum
patent: 5343251 (1994-08-01), Nafeh
patent: 5428733 (1995-06-01), Carr
patent: 5434965 (1995-07-01), Matheny et al.
patent: 5696866 (1997-12-01), Iggulden et al.
patent: 5708477 (1998-01-01), Forbes et al.
patent: 5794178 (1998-08-01), Caid et al.
patent: 5924077 (1999-07-01), Beach et al.
patent: 5983237 (1999-11-01), Jain et al.
patent: 6055043 (2000-04-01), Chambers
patent: 6055548 (2000-04-01), Comer et al.
patent: 6138130 (2000-10-01), Adler et al.
patent: 6173275 (2001-01-01), Caid et al.
patent: 6363380 (2002-03-01), Dimitrova
patent: 6611842 (2003-08-01), Brown
patent: 20020004793 (2002-01-01), Keith
patent: 20020147782 (2002-10-01), Dimitrova et al.
patent: 20020161747 (2002-10-01), Li et al.
patent: 20030004966 (2003-01-01), Bolle et al.
patent: 20030028534 (2003-02-01), Crivella et al.
patent: 20030033347 (2003-02-01), Bolle et al.
Dynamic behavior analysis in compressed fingerprint videos Dorai, C.; Ratha, N.K.; Bolle, R.M.; Circuits and Systems for Video Technology, IEEE Transactions on ,vol.: 14 , Issue: 1 , Jan. 2004 pp.: 58-73.*
Biometrics:The future of identification Pankanti, S.; Bolle, R.M.; Jain, A.; Computer ,vol.: 33 , Issue: 2 , Feb. 2000 □□pp.: 46-49.*
On geometric hashing and the generalized Hough transform Hecker, Y.C.; Bolle, R.M.; Systems, Man and Cybernetics, IEEE Transactions on ,vol.:24 , Issue: 9 , Sep. 1994 pp.: 1328-1338.*
The multiple window parameter transform Califano, A.; Bolle, R.M.; Pattern Analysis and Machine Intelligence, IEEE Transactions on ,vol.: 14 ,Issue: 12 , Dec. 1992 pp.: 1157-1170.*
Yao Wang, Zhu Liu, and Jin-Cheng Huang, “Multimedia Content Analysis” Using Both Audio and Video Vlues, IEEE Signal Processing Magazine, Nov. 2000, pps. 12-36.*
A complete and extendable approach to visual recognition Bolle, R.M.; Califano, A.; Kjeldsen, R.; Pattern Analysis and Machine Intelligence, IEEE Transactions on ,vol.: 14 , Issue: 5 , May 1992 pp.: 534-548.*
VeggieVision: a produce recognition system Bolle, R.M.; Connell, J.H.; Haas, N.; Mohan, R.; Taubin, G.; Applications of Computer Vision, 1996. WACV '96., Proceedings 3rd IEEE Workshop on , Dec. 2-4 1996 pp.: 244-251.*
Is there any texture in the image? Karu, K.; Jain, A.K.; Bolle, R.M.; Pattern Recognition, 1996., Proceedings of the 13th International Conference on ,vol.: 2 , Aug. 25-29 1996 pp.: 770-774 vol. 2.*
Comparison of distance measures for video copy detection Hampapur, A.; Bolle, R.M.; Multimedia and Expo, 2001. ICME 2001. IEEE International Conference on , Aug. 22-25, 2001 pp.: 737-740.*
Representing and comparing shapes using shape polynomials Taubin, G.; Bolle, R.M.; Cooper, D.B.; Computer Vision and Pattern Recognition, 1989. Proceedings CVPR '89., IEEE Computer Society Conference on , Jun. 4-8, 1989 pp.: 510-516.*
Content-based digital video retrieval Bolle, R.M.; Yeo, B.L.; Yeung, M.M.; Broadcasting Convention, 1997. International , Sep. 12-16, 1997 pp.: 160-165.*
Data and model driven foveation Califano, A.; Kjeldsen, R.; Bolle, R.M.; Pattern Recognition, 1990. Proceedings., 10th International Conference on ,vol.: i , Jun. 16-21, 1990 pp.: 1-7 vol. 1.*
Smith et al., “Video Skimming and Characterization Through the Combination of Image and Language Understanding Techniques,” CMU-CS-97-11, pp. 1-11 (1997).
Local Color Detection Applied to TV Commercial Recognition, Third International Conference, Visual ‘99’, Amsterdam, Jun. 1999, pp. 237-244, by J.M. Sanchez, X. Binega, J. Vitria and P. Radeva.
“Veggie Vision: A Produce Recognition System,” in Proc. Third IEEE Workshop on Applications of Computer Vision, pp. 244-251, Dec. 1996 b R. Bolle, J. Conell, G. Taubin.
Finding Pirated Video Sequences on the Internet. tech.rep., Stanford Infolab, Feb. 1999, by P. Indyk, G. Iyengar and N. Shivakumar.
“Video Sequence Matching”, International Conference on Acoustics, Speech and Signal Processing, (ICASSP), May, 1998, by R. Mohan.
Feature Based Indexing For Media Tracking. In Proc. of Int. Conf. on Multimedia and Expo, Aug. 2000, pp. 67-70 by R.M. Bolle and A. Hampapur.
“On the Detection and Recognition of Television Commercials.” In Proc. of the IEEE Conf. on Multimedia Computing and Systems, 1997, by C. Lienhart, C. Kuhmunch and W. Effelsburg.
“Video Classification Based on HMM Using Text and Faces,” (Dimitrova, et al).
“Video Skimming For Quick Browsing Based on Audio and Image Characterization”, Carnegie Mellon University, Tech Rep. CMU-CS-95-186, Jun. 1995 (Smith, et al) by M.A. Smith and T. Kanade.
“Towards Content-Based Browsing of Broadcast News Video,” in Intelligent Multimedia Info Retrieval, Issue 197, M.T. Maybury (ed.), pp. 241-258. AAAI Press/The MIT Press (Mani, et al), by I. Mani, D. House, M. Maybury, M. Green.
“Speech Transcript Analysis For Automatic Search,” IBM Research Tech Rep., by A.R. Coden, E.W. Brown.
“Multi-Paragraph Segmentation of Expository Text.” Proceedings of the 32nd Annual Conference of the Association for Computational Linguistics, Las Cruces, NM 1994, pp. 9-16, by M.A. Hearst.
“An Automated Method of Finding Topic Boundaries”, Proceedings of the 32nd Annual Conference of the Association for Computational Linguistics, Student Session, Las Cruces, NM 1994, pp. 331-333.
Text Segmentation by Topic, in Proceedings of the First European Conference on Research and Advanced Technology for Digital Libraries, 1997, pp. 120-129, J.M. Ponte and W.B. Croft.
“Text Based on Similarities Between Words”, In Proceedings of the 31st Annual Conference of the Association for Computational Linguistics, Columbus, OH, pp. 286-288, 1993, by H. Kozima.
Combining Knowledge Sources For Discourse Segmentation. In Proceedings of the 33 Annual Conference of the Association for Computational Linguistics, Cambridge, M.A., by Litman, D.J. and Passoneau R.J. 1995.
Text Segmentation Using Exponential Models, CMU Tech Rep. by D. Beeferman, A. Berger and J. Lafferty.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method and apparatus for inducing classifiers for multimedia... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method and apparatus for inducing classifiers for multimedia..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for inducing classifiers for multimedia... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3452421

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.