Method and apparatus for indexing speech

Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S254000, C704S275000, C369S025010, C369S027010

Reexamination Certificate

active

07634407

ABSTRACT:
A method of indexing a speech segment includes identifying at least two alternative word sequences based on the speech segment. For each word in the alternative sequences, information is placed in an entry for the word in the index. The information indicates the position of the word in at least one of the alternative sequences.

REFERENCES:
patent: 4783803 (1988-11-01), Baker et al.
patent: 4977598 (1990-12-01), Doddington et al.
patent: 5745899 (1998-04-01), Burrows
patent: 6047283 (2000-04-01), Braun
patent: 6185527 (2001-02-01), Petkovic et al.
patent: 6266658 (2001-07-01), Adya et al.
patent: 6345253 (2002-02-01), Viswanathan
patent: 6374220 (2002-04-01), Kao
patent: 6584458 (2003-06-01), Millett et al.
patent: 6611803 (2003-08-01), Furuyama et al.
patent: 6873993 (2005-03-01), Charlesworth et al.
patent: 6907397 (2005-06-01), Kryze et al.
patent: 7092883 (2006-08-01), Gretter et al.
patent: 7216077 (2007-05-01), Padmanabhan et al.
patent: 7266553 (2007-09-01), Anderson et al.
patent: 7313554 (2007-12-01), Chen et al.
patent: 7379870 (2008-05-01), Belvin et al.
patent: 2002/0052870 (2002-05-01), Charlesworth et al.
patent: 2002/0184196 (2002-12-01), Lehmeier et al.
patent: 2003/0055634 (2003-03-01), Hidaka et al.
patent: 2003/0187643 (2003-10-01), Van Thong et al.
patent: 2003/0187649 (2003-10-01), Logan et al.
patent: 2004/0044952 (2004-03-01), Jiang et al.
patent: 2004/0199385 (2004-10-01), Deligne et al.
patent: 2005/0060139 (2005-03-01), Corston-Oliver et al.
patent: 2005/0159953 (2005-07-01), Seide et al.
patent: 2005/0228671 (2005-10-01), Olorenshaw et al.
patent: 2007/0106509 (2007-05-01), Acero et al.
patent: 2007/0106512 (2007-05-01), Acero et al.
patent: 2007/0143110 (2007-06-01), Acero et al.
patent: 1 043 665 (2000-10-01), None
patent: WO 00/54168 (2000-09-01), None
patent: WO 02/27546 (2002-04-01), None
Kenneth Ward Church, “Speech and Language Processing: Where have we been and where are we going?,” inProceedings of Eurospeech, Geneva, Switzerland, 2003.
J. Garofolo, G. Auzanne, and E. Voorhees, “The TREC spoken document retrieval track: A success story,” inProceedings of the Recherche d'Informations Assiste par Ordinateur: ContentBased Multimedia Information Access Conference, Apr. 2000.
M. G. Brown, J. T. Foote, G. J. F. Jones, K. Sparck Jones, and S. J. Young, “Open-vocabulary speech indexing for voice and video mail retrieval,” inProc. ACM Multimedia 96, Boston, Nov. 1996.
David Anthony James, “The Application of Classical Information Retrieval Techniques to Spoken Documents,” Ph.D. thesis, University of Cambridge, Downing College, 1995.
Ciprian Chelba and Alex Acero, “Position specific posterior lattices for indexing speech,” inProceedings of ACL, Ann Arbor, Michigan, Jun. 2005.
Sergey Brin and Lawrence Page, “The anatomy of a large-scale hypertextual web search engine,”Computer Networks and ISDN Systems, vol. 30, No. 1-7, pp. 107-117, 1998.
L. R. Rabiner, “A tutorial on hidden markov models and selected applications in speech recognition,” inProceedings IEEE, 1989, vol. 77(2), pp. 257-285.
James Glass, T. J. Hazen, Lee Hetherington, and Chao Wang, “Analysis and processing of lecture audio data: Preliminary investigations,” inHLT-NAACL 2004 Workshop: Interdisciplinary Approaches to Speech Indexing and Retrieval, Boston, Massachusetts, May 2004, pp. 9-12.
Aubert, X. L. “Fast Look-ahead Pruning Strategies in Continuous Speech Recognition.” Proc. ICASSP-89. 1989. 659-662.
Sergey Brin and Lawrence Page. 1998. The anatomy of a large-scale hypertextual Web search engine.Computer Networks and ISDN Systems, 30(1-7):107-117.
M.G. Brown, J.T. Foote, G.J.F. Jones, K. Spärck Jones, and S.J. Young. 1996. Open-vocabulary speech indexing for voice and video mail retrieval. InProc. ACM Multimedia 96, pp. 307-316, Boston, November.
Kenneth Ward Church. 2003. Speech and language processing: Where have we been and where are we going? InProceedings of Eurospeech, Geneva, Switzerland.
J. Garofolo, G. Auzanne, and E. Vorrhees. 2000. The TREC spoken document retrieval track: A success story. InProceedings of the Recherche d'Informations Assiste par Ordinateur: ContentBased Multimedia Information Access Conference, April.
James Glass, Timothy J. Hazen, Lee Hetherington, and Chao Wang. 2004. Analysis and processing of lecture audio data: Preliminary investigations. InHLT-NAACL 2004 Workshop: Interdisciplinary Approaches to Speech Indexing and Retrieval, pp. 9-12, Boston, Massachusetts, USA, May 6.
David Anthony James. 1995.The Application of Classical Information Retrieval Techniques to Spoken Documents. Ph.D. thesis, University of Cambridge, Downing College.
B. Logan, P. Moreno, and O. Deshmukh. 2002. Word and sub-word indexing approaches for reducing the effects of OOV queries on spoken audio. InProc. HLT.
Kenney Ng. 2000.Subword-Based Approaches for Spoken Document Retrieval. Ph.D. thesis, Massachusetts Institute of Technology.
L.R. Rabiner. 1989. A tutuorial on hidden markov models and selected applications in speech recognition. InProceedings IEEE, vol. 77(2), pp. 257-285.
Murat Saraclar and Richard Sproat. 2004. Lattic-based search for spoken utterance retrieval. InHLT-NAACL 2004: Main Proceedings, pp. 129-136, Boston, Massachusetts, USA, May 2-May 7.
F. Seide and P. Yu. 2004a. A hybrid word/phonemebased approach for improved vocabulary-independent search in spontaneous speech. InProceedings of IC-SLP, Jeju, Korea.
F. Seide and P. Yu. 2004b. Vocabulary-independent search in spontaneous speech. InProceedings of ICASSP, Montreal, Canada.
Matthew A. Siegler. 1999.Integration of Continuous Speech Recognition and Information Retrieval for Mutually Optimal Performance. Ph.D. thesis, Carnegie Mellon University.
P.C. Woodland, S.E. Johnson, P. Jourlin and K. Spärck Jones. 2000. Effects of out of vocabulary words in spoken document retrieval. InProceedings of SIGIR, pp. 372-374, Athens, Greece.
Ljolje, A., Pereira, F. & Riley, M. (1999). Efficient General Lattice Generation and Rescoring. In Proceedings of the 6thEuropean Conference on Speech Communications and Technology, vol. 3 pp. 1251-1254, Budapest.
Chelba et al., C. “Speech OGLE: Indexing Uncertainty for Spoken Document Search”, Proceedings of the ACL Interactive Poster and Demonstration Sessions, pp. 41-44, Ann Arbor, Jun. 2005.
Mangu et al., L., “Finding consensus in speech recognition: word error minimization and other applications of confusion networks”, Computer Speech and Language vol. 14, No. 4, Oct. 7, 2000.
MSN Search, “Index Serving Core”, design specification, 2004.
International Search Report and Written Opinion of the International Searching Authority for Application No. PCT/US2006/042723 filed Oct. 31, 2006. Date of Mailing: Mar. 30, 2007.
Douglas Oard, Bhuvana Ramabhadran, and Samuel Gustman (2004). Building an Information Retrieval Test Collection for Spontaneous Conversational Speech. In Proceedings of SIGIR 2004.
J. P. A. Charles and P. N. Garner, “Spoken content metadata and MPEG-7,” in Proc. ACM MM2000 Workshops, 2000, pp. 81-84.
J. V. Thong, P. J. Moreno, B. Logan, B. Fidler, K. Maffey, and M. Moores, SPEECHBOT: An Experimental Speech-Based Search Engine for Multimedia Content in the Web: Compaq Cambridge Res. Lab. Tech. Rep., CRL Jun. 2001.
Dharanipragada, S., and Roukos, S. A Fast vocabulary independent algorithm for spoiling words in speech. In Proceedings of ICASSP 98, 1998.
Huang, X., Acero, A. Alleva, F., Hwang, M., Jiang, L. and Mahajan, M. Microsoft Windows Highly Intelligent Speech Recognizer: Whisper. In IEEE International Conference on Acoustics, Speech, and Signal Processing, May 1995, vol. 1, pp. 93-96.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method and apparatus for indexing speech does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method and apparatus for indexing speech, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for indexing speech will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4096288

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.