Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2005-11-08
2010-10-05
Dorvil, Richemond (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S007000, C707S711000
Reexamination Certificate
active
07809568
ABSTRACT:
An index for searching spoken documents having speech data and text meta-data is created by obtaining probabilities of occurrence of words and positional information of the words of the speech data and combining it with at least positional information of the words in the text meta-data. A single index can be created because the speech data and the text meta-data are treated the same and considered only different categories.
REFERENCES:
patent: 4783803 (1988-11-01), Baker et al.
patent: 4977598 (1990-12-01), Doddington et al.
patent: 5199077 (1993-03-01), Wilcox et al.
patent: 5241619 (1993-08-01), Schwartz et al.
patent: 5745899 (1998-04-01), Burrows
patent: 5799276 (1998-08-01), Komissarchik et al.
patent: 5963940 (1999-10-01), Liddy et al.
patent: 6006221 (1999-12-01), Liddy et al.
patent: 6047283 (2000-04-01), Braun
patent: 6169972 (2001-01-01), Kono et al.
patent: 6185527 (2001-02-01), Petkovic et al.
patent: 6266658 (2001-07-01), Adya et al.
patent: 6345253 (2002-02-01), Viswanathan
patent: 6374220 (2002-04-01), Kao
patent: 6397181 (2002-05-01), Li et al.
patent: 6421645 (2002-07-01), Beigi
patent: 6424946 (2002-07-01), Tritschler
patent: 6584458 (2003-06-01), Millett et al.
patent: 6611803 (2003-08-01), Furuyama et al.
patent: 6678689 (2004-01-01), Yoon
patent: 6760702 (2004-07-01), Chien
patent: 6873993 (2005-03-01), Charlesworth et al.
patent: 6877134 (2005-04-01), Fuller et al.
patent: 6907397 (2005-06-01), Kryze et al.
patent: 7401019 (2005-07-01), Seide et al.
patent: 7089188 (2006-08-01), Logan et al.
patent: 7092883 (2006-08-01), Gretter et al.
patent: 7216077 (2007-05-01), Padmanabhan et al.
patent: 7266553 (2007-09-01), Anderson et al.
patent: 7313554 (2007-12-01), Chen et al.
patent: 7379870 (2008-05-01), Belvin et al.
patent: 2002/0022960 (2002-02-01), Charlesworth et al.
patent: 2002/0111792 (2002-08-01), Cherny
patent: 2002/0184196 (2002-12-01), Lehmeier et al.
patent: 2003/0055634 (2003-03-01), Hidaka et al.
patent: 2003/0088397 (2003-05-01), Karas et al.
patent: 2003/0177108 (2003-09-01), Charlesworth
patent: 2003/0187643 (2003-10-01), Van Thong et al.
patent: 2003/0187649 (2003-10-01), Logan et al.
patent: 2003/0200091 (2003-10-01), Furuyama et al.
patent: 2003/0204399 (2003-10-01), Wolf et al.
patent: 2004/0044952 (2004-03-01), Jiang et al.
patent: 2004/0162730 (2004-08-01), Mahajan et al.
patent: 2004/0199385 (2004-10-01), Deligne et al.
patent: 2005/0010412 (2005-01-01), Aronowitz
patent: 2005/0060139 (2005-03-01), Corston-Oliver et al.
patent: 2005/0080631 (2005-04-01), Abe et al.
patent: 2005/0096908 (2005-05-01), Bacchiani et al.
patent: 2005/0119885 (2005-06-01), Axelrod et al.
patent: 2005/0228671 (2005-10-01), Olorenshaw et al.
patent: 2006/0074895 (2006-04-01), Belknap
patent: 2006/0212294 (2006-09-01), Gorin et al.
patent: 2007/0005574 (2007-01-01), Crispo et al.
patent: 2007/0106509 (2007-05-01), Acero et al.
patent: 2007/0106512 (2007-05-01), Acero et al.
patent: 2007/0143110 (2007-06-01), Acero et al.
patent: 1 043 665 (2000-10-01), None
patent: 01113371 (2001-07-01), None
patent: WO 00/54168 (2000-09-01), None
patent: WO 02/27546 (2002-04-01), None
Glavitsch, P. Schaäuble, and M. Wechsler, “Metadata for integrating speech documents in a text retrieval system,” ACM SIGMOD Rec., vol. 23, No. 4, pp. 57-63, 1994.
Bulyko, I., Ostendorf, M., Stolcke, A.: Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures. In Hearst, M., Ostendorf, M., eds.: Proc. HLT-NAACL. vol. 2., Edmonton, Alberta, Canada, Association for Computational Linguistics (2003) 7-9.
Douglas Oard, Bhuvana Ramabhadran, and Samuel Gustman (2004). Building an Information Retrieval Test Collection for Spontaneous Conversational Speech. In Proceedings of SIGIR 2004.
J. P. A. Charlesworth and P. N. Garner, “Spoken content metadata and MPEG-7,” in Proc. ACM MM2000 Workshops, 2000, pp. 81-84.
N. Moreau, H. G. Kim, and T. Sikora. Phone-based spoken document retrieval in conformance with the mpeg-7 standard. Proc. of the Audio Engineering Society 25th Intl. Conf., 2004.
A. T Lindsay, S. Srinivasan, J. P. A. Charlesworth, P. N. Garner, and W. Kriechbaum, “Representation and linking mechanisms for audio in MPEG-7,” Signal Processing: Image Commun., vol. 16, pp. 193-209, 2000.
Moreau N., Kim H.-G., Sikora T., “Combination of Phone N-Grams for a MPEG-7-based Spoken Document Retrieval System”, to be published in EUSIPCO 2004.
Charlesworth J. P. A. & Garner P. N., “SpokenContent Representation in MPEG-7”, IEEE Trans. on Circuits and Systems for Video Technology, vol. 11, No. 6, pp. 730-736, Jun. 2001.
J. T Foote, S. J. Young, G. J. F Jones, and K. Sparck Jones. 1997. Unconstrained keyword spotting using phone lattices with application to spoken document retrieval. Computer Speech and Language, 11(2):207-224.
Yue-Shi Lee and Hsin-Hsi Chen. “A Multimedia Retrieval System for Retrieving Chinese Text and Speech Documents” 1999.
D. A. James. The Application of Classical Information Retrieval Techniques to Spoken Documents. PhD thesis, Cambridge University, Downing College, Feb. 1995.
Lidia Mangu, Eric Brill, Andreas Stolcke, “Finding Consensus Amongwords: Lattice-Basedword Error Minimization” Sep. 1999.
Yang Liu, Mary P. Harper, Michael T. Johnson, Leah H. Jamieson, “The Effect of Pruning and Compression on Graphical Representations of the Output of a Speech Recognizer” Feb. 14, 2002.
Hillard et al. “Improving Automatic Sentence Boundary Detection with Confusion Networks” 2004.
Peter S. Cardillo, Mark Clements and Michael S. Miller. “Phonetic Searching vs. LVCSR: How to Find What You ReallyWant in Audio Archives” 2002.
Begeja et al. “A System for Searching and Browsing Spoken Communications” 2004.
Ulrike Glavitsch, Peter SchΣble, Martin Wechsler. “Metadata for Integrating Speech Documents in a Text Retrieval System” 1994.
Alexandre Ferrieux and Stephane Peillon. “Phoneme-Level Indexing for Fast and Vocabulary-Independent Voice/Voice Retrieval” 1999.
Alluzen et al. “Open Vocabulary ASR for Audiovisual Document Indexation” ICASSP 2005.
Yue-Shi Lee and Hsin-Hsi Chen. “Metadata for Integrating Chinese Text and Speech Documents in a Multimedia Retrieval System” 1997.
Cyril Allauzen and Mehryar Mohri and Murat Saraclar. “General Indexation of Weighted Automata—Application to Spoken Utterance Retrieval” 2004.
Lidia Mangu and Eric Brill. “Lattice Compression in the Consensual Post-Processing Framework” 1999.
Method and Apparatus for Indexing Speech, filed May 20, 2005, U.S. Appl. No. 11/133,515, pp. 1-33 and 7 sheets of drawings.
Kenneth Ward Church, “Speech and language processing: Where have we been and where are we going?,” inProceedings of Eurospeech, Geneva, Switzerland, 2003.
M. G. Brown, J. T. Foote, G. J. F. Jones, K. Späarck Jones, and S. J. Young, “Open-vocabulary speech indexing for voice and video mail retrieval,” inProc. ACM Multimedia 96, Boston, Nov. 1996, pp. 307-316.
David Anthony James,The Application of Classical Information Retrieval Techniques to Spoken Documents, Ph.D. thesis, University of Cambridge, Downing College, 1995.
Ciprian Chelba and Alex Acero, “Position specific posterior lattices for indexing speech,” inProceedings of ACL, Ann Arbor, Michigan, Jun. 2005.
Sergey Brin and Lawrence Page, “The anatomy of a large-scale hypertextualWeb search engine,”Computer Networks and ISDN Systems, vol. 30, No. 1-7, pp. 107-117, 1998.
Chelba et al., C. “Speech OGLE: Indexing Uncertainty for Spoken Document Search”, Proceedings of the ACL Interactive Poster and Demonstration Sessions, pp. 41-44, Ann Arbor, Jun. 2005.
MSN Search, “Index Serving Core”, design specification, 2004.
Kenneth Ward Church, “Speech and Language Processing: Where have we been and where are we going?,”
Acero Alejandro
Chelba Ciprian I.
Sanchez Jorge F. Silva
Borsetti Greg A
Dorvil Richemond
Koehler Steven M.
Microsoft Corporation
Westman Champlin & Kelly P.A.
LandOfFree
Indexing and searching speech with text meta-data does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Indexing and searching speech with text meta-data, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Indexing and searching speech with text meta-data will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4174254