Multimedia data management by speech recognizer annotation

Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S239000, C704S243000, C707S793000

Reexamination Certificate

active

07739110

ABSTRACT:
A method and an apparatus for multimedia data management are disclosed. The method provides an indexing and retrieval scheme for digital photos with speech annotations based on image-like patterns transformed from the recognized syllable candidates. For annotated spoken content, the recognized n-best syllable candidates are transformed into a sequence of syllable-transformed patterns. Eigen-image analysis is further adopted to extract the significant information to reduce the dimensionality. Vector quantization is applied to quantize the syllable-transformed patterns into feature vectors for indexing. The invention of indexing scheme reduces the dimensionality and noise of data, and achieves better performance of 16.26% for speech annotated photo retrieval.

REFERENCES:
patent: 4087630 (1978-05-01), Browning et al.
patent: 4624011 (1986-11-01), Watanabe et al.
patent: 4677672 (1987-06-01), Ukita et al.
patent: 4718092 (1988-01-01), Klovstad
patent: 4718093 (1988-01-01), Brown
patent: 4903305 (1990-02-01), Gillick et al.
patent: 4903306 (1990-02-01), Nakamura
patent: 5532936 (1996-07-01), Perry
patent: 5625749 (1997-04-01), Goldenthal et al.
patent: 5679001 (1997-10-01), Russell et al.
patent: 5835667 (1998-11-01), Wactlar et al.
patent: 6061652 (2000-05-01), Tsuboka et al.
patent: 6185527 (2001-02-01), Petkovic et al.
patent: 6243713 (2001-06-01), Nelson et al.
patent: 6397181 (2002-05-01), Li et al.
patent: 6449595 (2002-09-01), Arslan et al.
patent: 6499016 (2002-12-01), Anderson
patent: 6542869 (2003-04-01), Foote
patent: 6684185 (2004-01-01), Junqua et al.
patent: 6813618 (2004-11-01), Loui et al.
patent: 6833865 (2004-12-01), Fuller et al.
patent: 7171360 (2007-01-01), Huang et al.
patent: 7181398 (2007-02-01), Thong et al.
patent: 7366656 (2008-04-01), Furst-Yust et al.
patent: 7574360 (2009-08-01), Wu et al.
patent: 2002/0038294 (2002-03-01), Matsugu
patent: 2003/0177108 (2003-09-01), Charlesworth et al.
patent: 2006/0095264 (2006-05-01), Wu et al.
patent: 2007/0174055 (2007-07-01), Chengalvarayan et al.
patent: 2009/0157402 (2009-06-01), Lin et al.
patent: 2002-49559 (2002-02-01), None
patent: 2006-58874 (2006-03-01), None
Visually Searching the Web for Content; John R. Smith and Shih-Fu Chang Jul.-Sep. 1997 IEEE; p. 12-20.
An Active Learning Framework for Content-Based Information Retrieval; Cha Zhang, Student Member, IEEE, and Tsuhan Chen, Member, IEEE IEEE Transactions on Multimedia, vol. 4, No. 2, Jun. 2002; p. 260-268.
Personalized Video Summary Using Visual Semantic Annotations and Automatic Speech Transcriptions Belle L. Tseng and Ching-Yung Lin; 2002 IEEE, p. 5-8.
Building Personal Digital Photograph Libraries: An Approach with Ontology-Based MPEG-7 Dozen Dimensional Digital Content Architectur; Pei-Jeng Kuo Terumasa Aoki Hiroshi Yasuda; Proceedings of the Computer Graphics International; 2004 IEEE; p. 1-5.
Semantic Video Summarization Using Mutual Reinforcement Principle and Shot Arrangement Patterns Shi Lu, Michael R. Lyu and Irwin King; Proceedings of the 11thInternational Multimedia Modeling Conference(MMM'05); 2005 IEEE.
Transcriber: Development and use of a tool for assisting speech corpora production; Claude Barras, Edouard Geoffrois, Zhibiao Wu, Mark Liberman; Accepted Aug. 2, 2000; 2001 Elsevier Science B. V.; p. 5-22.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Multimedia data management by speech recognizer annotation does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Multimedia data management by speech recognizer annotation, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Multimedia data management by speech recognizer annotation will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4196657

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.