Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2005-05-20
2009-12-15
Abebe, Daniel D (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S254000, C704S275000, C369S025010, C369S027010
Reexamination Certificate
active
07634407
ABSTRACT:
A method of indexing a speech segment includes identifying at least two alternative word sequences based on the speech segment. For each word in the alternative sequences, information is placed in an entry for the word in the index. The information indicates the position of the word in at least one of the alternative sequences.
REFERENCES:
patent: 4783803 (1988-11-01), Baker et al.
patent: 4977598 (1990-12-01), Doddington et al.
patent: 5745899 (1998-04-01), Burrows
patent: 6047283 (2000-04-01), Braun
patent: 6185527 (2001-02-01), Petkovic et al.
patent: 6266658 (2001-07-01), Adya et al.
patent: 6345253 (2002-02-01), Viswanathan
patent: 6374220 (2002-04-01), Kao
patent: 6584458 (2003-06-01), Millett et al.
patent: 6611803 (2003-08-01), Furuyama et al.
patent: 6873993 (2005-03-01), Charlesworth et al.
patent: 6907397 (2005-06-01), Kryze et al.
patent: 7092883 (2006-08-01), Gretter et al.
patent: 7216077 (2007-05-01), Padmanabhan et al.
patent: 7266553 (2007-09-01), Anderson et al.
patent: 7313554 (2007-12-01), Chen et al.
patent: 7379870 (2008-05-01), Belvin et al.
patent: 2002/0052870 (2002-05-01), Charlesworth et al.
patent: 2002/0184196 (2002-12-01), Lehmeier et al.
patent: 2003/0055634 (2003-03-01), Hidaka et al.
patent: 2003/0187643 (2003-10-01), Van Thong et al.
patent: 2003/0187649 (2003-10-01), Logan et al.
patent: 2004/0044952 (2004-03-01), Jiang et al.
patent: 2004/0199385 (2004-10-01), Deligne et al.
patent: 2005/0060139 (2005-03-01), Corston-Oliver et al.
patent: 2005/0159953 (2005-07-01), Seide et al.
patent: 2005/0228671 (2005-10-01), Olorenshaw et al.
patent: 2007/0106509 (2007-05-01), Acero et al.
patent: 2007/0106512 (2007-05-01), Acero et al.
patent: 2007/0143110 (2007-06-01), Acero et al.
patent: 1 043 665 (2000-10-01), None
patent: WO 00/54168 (2000-09-01), None
patent: WO 02/27546 (2002-04-01), None
Kenneth Ward Church, “Speech and Language Processing: Where have we been and where are we going?,” inProceedings of Eurospeech, Geneva, Switzerland, 2003.
J. Garofolo, G. Auzanne, and E. Voorhees, “The TREC spoken document retrieval track: A success story,” inProceedings of the Recherche d'Informations Assiste par Ordinateur: ContentBased Multimedia Information Access Conference, Apr. 2000.
M. G. Brown, J. T. Foote, G. J. F. Jones, K. Sparck Jones, and S. J. Young, “Open-vocabulary speech indexing for voice and video mail retrieval,” inProc. ACM Multimedia 96, Boston, Nov. 1996.
David Anthony James, “The Application of Classical Information Retrieval Techniques to Spoken Documents,” Ph.D. thesis, University of Cambridge, Downing College, 1995.
Ciprian Chelba and Alex Acero, “Position specific posterior lattices for indexing speech,” inProceedings of ACL, Ann Arbor, Michigan, Jun. 2005.
Sergey Brin and Lawrence Page, “The anatomy of a large-scale hypertextual web search engine,”Computer Networks and ISDN Systems, vol. 30, No. 1-7, pp. 107-117, 1998.
L. R. Rabiner, “A tutorial on hidden markov models and selected applications in speech recognition,” inProceedings IEEE, 1989, vol. 77(2), pp. 257-285.
James Glass, T. J. Hazen, Lee Hetherington, and Chao Wang, “Analysis and processing of lecture audio data: Preliminary investigations,” inHLT-NAACL 2004 Workshop: Interdisciplinary Approaches to Speech Indexing and Retrieval, Boston, Massachusetts, May 2004, pp. 9-12.
Aubert, X. L. “Fast Look-ahead Pruning Strategies in Continuous Speech Recognition.” Proc. ICASSP-89. 1989. 659-662.
Sergey Brin and Lawrence Page. 1998. The anatomy of a large-scale hypertextual Web search engine.Computer Networks and ISDN Systems, 30(1-7):107-117.
M.G. Brown, J.T. Foote, G.J.F. Jones, K. Spärck Jones, and S.J. Young. 1996. Open-vocabulary speech indexing for voice and video mail retrieval. InProc. ACM Multimedia 96, pp. 307-316, Boston, November.
Kenneth Ward Church. 2003. Speech and language processing: Where have we been and where are we going? InProceedings of Eurospeech, Geneva, Switzerland.
J. Garofolo, G. Auzanne, and E. Vorrhees. 2000. The TREC spoken document retrieval track: A success story. InProceedings of the Recherche d'Informations Assiste par Ordinateur: ContentBased Multimedia Information Access Conference, April.
James Glass, Timothy J. Hazen, Lee Hetherington, and Chao Wang. 2004. Analysis and processing of lecture audio data: Preliminary investigations. InHLT-NAACL 2004 Workshop: Interdisciplinary Approaches to Speech Indexing and Retrieval, pp. 9-12, Boston, Massachusetts, USA, May 6.
David Anthony James. 1995.The Application of Classical Information Retrieval Techniques to Spoken Documents. Ph.D. thesis, University of Cambridge, Downing College.
B. Logan, P. Moreno, and O. Deshmukh. 2002. Word and sub-word indexing approaches for reducing the effects of OOV queries on spoken audio. InProc. HLT.
Kenney Ng. 2000.Subword-Based Approaches for Spoken Document Retrieval. Ph.D. thesis, Massachusetts Institute of Technology.
L.R. Rabiner. 1989. A tutuorial on hidden markov models and selected applications in speech recognition. InProceedings IEEE, vol. 77(2), pp. 257-285.
Murat Saraclar and Richard Sproat. 2004. Lattic-based search for spoken utterance retrieval. InHLT-NAACL 2004: Main Proceedings, pp. 129-136, Boston, Massachusetts, USA, May 2-May 7.
F. Seide and P. Yu. 2004a. A hybrid word/phonemebased approach for improved vocabulary-independent search in spontaneous speech. InProceedings of IC-SLP, Jeju, Korea.
F. Seide and P. Yu. 2004b. Vocabulary-independent search in spontaneous speech. InProceedings of ICASSP, Montreal, Canada.
Matthew A. Siegler. 1999.Integration of Continuous Speech Recognition and Information Retrieval for Mutually Optimal Performance. Ph.D. thesis, Carnegie Mellon University.
P.C. Woodland, S.E. Johnson, P. Jourlin and K. Spärck Jones. 2000. Effects of out of vocabulary words in spoken document retrieval. InProceedings of SIGIR, pp. 372-374, Athens, Greece.
Ljolje, A., Pereira, F. & Riley, M. (1999). Efficient General Lattice Generation and Rescoring. In Proceedings of the 6thEuropean Conference on Speech Communications and Technology, vol. 3 pp. 1251-1254, Budapest.
Chelba et al., C. “Speech OGLE: Indexing Uncertainty for Spoken Document Search”, Proceedings of the ACL Interactive Poster and Demonstration Sessions, pp. 41-44, Ann Arbor, Jun. 2005.
Mangu et al., L., “Finding consensus in speech recognition: word error minimization and other applications of confusion networks”, Computer Speech and Language vol. 14, No. 4, Oct. 7, 2000.
MSN Search, “Index Serving Core”, design specification, 2004.
International Search Report and Written Opinion of the International Searching Authority for Application No. PCT/US2006/042723 filed Oct. 31, 2006. Date of Mailing: Mar. 30, 2007.
Douglas Oard, Bhuvana Ramabhadran, and Samuel Gustman (2004). Building an Information Retrieval Test Collection for Spontaneous Conversational Speech. In Proceedings of SIGIR 2004.
J. P. A. Charles and P. N. Garner, “Spoken content metadata and MPEG-7,” in Proc. ACM MM2000 Workshops, 2000, pp. 81-84.
J. V. Thong, P. J. Moreno, B. Logan, B. Fidler, K. Maffey, and M. Moores, SPEECHBOT: An Experimental Speech-Based Search Engine for Multimedia Content in the Web: Compaq Cambridge Res. Lab. Tech. Rep., CRL Jun. 2001.
Dharanipragada, S., and Roukos, S. A Fast vocabulary independent algorithm for spoiling words in speech. In Proceedings of ICASSP 98, 1998.
Huang, X., Acero, A. Alleva, F., Hwang, M., Jiang, L. and Mahajan, M. Microsoft Windows Highly Intelligent Speech Recognizer: Whisper. In IEEE International Conference on Acoustics, Speech, and Signal Processing, May 1995, vol. 1, pp. 93-96.
Acero Alejandro
Chelba Ciprian I.
Abebe Daniel D
Magee Theodore M.
Microsoft Corporation
Westman Champlin & Kelly P.A.
LandOfFree
Method and apparatus for indexing speech does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for indexing speech, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for indexing speech will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4096288