Multimodal access of meeting recordings

Image analysis – Image transformation or preprocessing – Image storage or retrieval

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000

Reexamination Certificate

active

10307235

ABSTRACT:
A meeting recorder captures multimodal information of a meeting. Subsequent analysis of the information produces scores indicative of visually and aurally significant events that can help identify significant segments of the meeting recording. Textual analysis can enhance searching for significant meeting segments and otherwise enhance the presentation of the meeting segments.

REFERENCES:
patent: 5664227 (1997-09-01), Mauldin et al.
patent: 5680481 (1997-10-01), Prasad et al.
patent: 5754938 (1998-05-01), Herz et al.
patent: 5835616 (1998-11-01), Lobo et al.
patent: 2003/0018475 (2003-01-01), Basu et al.
B. Kapralos, M. Jenkin, E. Milios, J. Tsotsos, “Eyes 'n Ears: Face Detection Utilizing Audio And Video Cues”, Proc. 2nd Int. Workshop on Recognition, Analysis and Tracking of Faces and Gestures in Real-Time Systems (RATFG-RTS 2001), 2001.
S. Nayar, “Omnidirectional video camera”, In Proceedings of the 1997 DARPA Image Understanding Workshop, May 1997.
Vahedian, A., Frater, M., Arnold, J., Cavenor, M., Godara, L., Pickering, M., “Estimation of Speaker Position using Audio Information”, TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications., Proceedings of IEEE, 1997.
Aramvith, S., Sun, M.T., “MPEG-1 and MPEG-2 Video Standards”, Image and Video Processing Handbook, Academic Press, 1999.
Hauptmann, A.G., Smith, M.A., “Text, Speech, and Vision for Video Segmentation: The Informedia Project”, 1995, Proceedings of the AAAI Fall Symposium on Computation.
Yeo, B.L., Liu, B., “Rapid Scene Analysis on Compressed Video”, Circuits and Systems for Video Technology, IEEE Transactions on, 1995.
Smith, M.A.; Kanade, T.; “Video Skimming and Characterization through the Combination of Image and Language Understanding”, IEEE International Workshop on Content-based Access of Image and Video Databases (ICCV98-Bombay, India).
Abdel-Mottaleb, M. and Elgammal, A. (1999). “Face Detection in complex environments from color images,” IEEE ICIP 622-626.
Arons, B. (1997). “SpeechSkimmer: A system for interactively skimming recorded speech,” ACM Transactions on Computer-Human Interaction 4(1):3-38.
Divakaran, A. et al. (2000). “Video browsing system based on compressed domain feature extraction,” IEEE Transactions on Consumer Electronics 46:637-644.
Dorai, C. and Kobla, V. (1999). “Perceived visual motion descriptors from MPEG-2 for content-based HDTV annotation and retrieval,” IEEE 3rd workshop on Multimedia Signal Processing, 147-152.
Erol, B. and Kossentini, F. (2001). “Local motion descriptors,” IEEE Workshop on Multimedia Signal Processing, 467-472.
Foote, J. and Kimber, D. (2000). “FlyCam: Practical panoramic video and automatic camera control,” Proceedings of International Conference on Multimedia & Expo, 3:1419-1422.
Foote, J. et al. (1998). “An Intelligent Media Browser Using Automatic Multimodal Analysis,” ACM Multimedia, 375-380.
Foote, J. et al. (1999). “Finding presentations in recorded meeting using audio and video features,” ICASPP, 3029-3032.
Gross, R. et al. (2000). “Towards a Multimodal meeting record,” Proceedings of International Conference on Multimedia and Expo, 1593-1596.
Gross, R. et al. (2000). “Face Recognition in a Meeting Room,” IEEE International Conference on Automatic Face and Gesture Recognition, 294-299.
Hauptmann, A.G. and Smith, M. (1995). “Text, speech, and vision for video segmentation: The Informedia™ project,” Proceedings of the AAAI Fall Symposium on Computation.
Hsu, R.L. et al. (2001). “Face dectection in color images,” Proc. International Conference on Image Processing, 1046-1049.
Johnson, S.E. (1999). “Who spoke when? -Automatic Segmentation and Clustering for Determining Speaker Turns,” Proc. of Eurospeech, 679-682.
Kapralos, B. et al. (2001). “Eyes 'n Ears Face Detection,” 2001 International Conference on Image Processing, 1:66-69.
Kimber, D. and Wilcox, L. (1996). “Acoustic segmentation for audio browsers,” in Proc. Interface Conference, Sydney, Australia, 10 pages.
Lee, D. et al. (2002). “Segmenting People in Meeting Videos Using Mixture Background and Object Models,” Proc. of Pacific Rim Conf. on Multimedia, Taiwan, Dec. 16-18, 8 pages total.
Maybury, M. et al. (1997). “Segmentation, content extraction and visualization of broadcast news video using multistream analysis,” AAAI, 12 pages total.
Myers, B.A. et al. (2001). A Multi-view intelligent editor for digital video libraries, Joint Conference on Digital Libraries, Roanoke, VA Jun. 24-28, 10 pages total.
Pfau, T. et al. (2001). “Multispeaker Speech Activity Detection for the ICSI Meeting Recorder,” Proc. IEEE Automatic Speech Recognition and Understanding Workshop, 4 pages total.
Pingali, G., et al. (2001). “Multimedia retrieval through spatio-temporal activity maps,” ACM Multimedia 129-136.
Rui, Y. et al. (2001). “Viewing meetings captured by an omni-directional camera,” ACM CHI 2001, Seattle, Mar. 31-Apr. 4, 2001, 450-457,.
Stauffer, C. and Grimson, W.E.L. (1999). “Adaptive Background Mixture Models for Real-Time Tracking,” Proceedings of Computer Vision and Pattern Recognition, 246-252.
Sun, X. et al. (2001). “A Motion activity descriptor and its extraction in compressed domain,” Proc. IEEE Pacific-Rim Conference on Multimedia (PCM '01) 450-457.
Sun, X. et al. (2001). “Panoramic video capturing and compressed domain virtual camera control,” ACM Multimedia 229-238.
Tritschler, A. and Gopinath, R. (1999). “Improved Speaker Segmantation and Segments Clustering using the Bayesian Information Criterion,” Proc. of Eurospeech, 679-682.
Waibel, A. et al. (2001). “Advances in automatic meeting record creation and access,” Proceedings of the International Conference on Acoustics. Speech, and Signal Processing, 597-600.
Yang, J. et al. (1999). “Multimodal People ID for a Multimedia Meeting Browser,” Proceedings of ACM Multimedia 159-168.
Yang, M.H. et al. (2002). “Detecting Faces in Images: A Survey,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(1):34-58.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Multimodal access of meeting recordings does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Multimodal access of meeting recordings, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Multimodal access of meeting recordings will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3835302

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.