Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2011-03-08
2011-03-08
Dorvil, Richemond (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S231000, C704S251000, C704S243000
Reexamination Certificate
active
07904296
ABSTRACT:
An approach to wordspotting (180) using query data from one or more spoken instance of a query (140). The query data is processed to determining a representation of the query (160) that defines multiple sequences of subword (130) units each representing the query. Then putative instances of the query (190) are located in input data from an audio signal using the determined representation of the query.
REFERENCES:
patent: 5165007 (1992-11-01), Bahl et al.
patent: 5199077 (1993-03-01), Wilcox et al.
patent: 5425129 (1995-06-01), Garman et al.
patent: 5509104 (1996-04-01), Lee et al.
patent: 5625748 (1997-04-01), McDonough et al.
patent: 5649057 (1997-07-01), Lee et al.
patent: 5748840 (1998-05-01), La Rue
patent: 5794194 (1998-08-01), Takebayashi et al.
patent: 5797123 (1998-08-01), Chou et al.
patent: 5895464 (1999-04-01), Bhandari et al.
patent: 5918222 (1999-06-01), Fukui et al.
patent: 6061652 (2000-05-01), Tsuboka et al.
patent: 6073095 (2000-06-01), Dharanipragada et al.
patent: 6185527 (2001-02-01), Petkovic et al.
patent: 6317710 (2001-11-01), Huang et al.
patent: 6345253 (2002-02-01), Viswanathan
patent: 6434520 (2002-08-01), Kanevsky et al.
patent: 6873993 (2005-03-01), Charlesworth et al.
patent: 6985861 (2006-01-01), Thong et al.
patent: 7212968 (2007-05-01), Garner et al.
patent: 7542966 (2009-06-01), Wolf et al.
patent: 7590605 (2009-09-01), Josifovski
patent: 7747611 (2010-06-01), Milic-Frayling et al.
patent: 2002/0013706 (2002-01-01), Profio
patent: 2002/0052740 (2002-05-01), Charlesworth et al.
patent: 2002/0052870 (2002-05-01), Charlesworth et al.
patent: 2002/0120447 (2002-08-01), Charlesworth et al.
patent: 2003/0110035 (2003-06-01), Thong et al.
patent: 2003/0187643 (2003-10-01), Van Thong et al.
patent: 2003/0204399 (2003-10-01), Wolf et al.
patent: 2003/0204492 (2003-10-01), Wolf et al.
patent: 2004/0083099 (2004-04-01), Scarano et al.
patent: 2005/0010412 (2005-01-01), Aronowitz
patent: 2007/0038450 (2007-02-01), Josifovski
patent: 0 800 158 (1997-10-01), None
R. Rose and D. Paul. A hidden Markov model based keyword recognition system. In Proc. I W P. pp. 129-132, NM, Apr. 1990.
T. Kawahara, C.-H. Lee, and B.-H. Juang. 1998. Flexible speech understanding based on combined key-phrase detection and verification. IEEE Trans. on Speech and Audio Processing, 6(6):558-568.
Ng K, Zue V (1997) Subword unit representations for spoken document retrieval. In: Proc. Eurospeech 97. ESCA.
Jonathan Foote, An overview of audio information retrieval, Multimedia Systems, v.7 n. 1, p. 2-10, Jan. 1999.
Sheridan, M. Wechsler and P. Schauble, Cross-language speech retrieval: establishing a baseline performance. In: Proc. 20th Ann. Internat. ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR-97) (1997), pp. 99-107.
Foote JT, Jones GJF, Sparck Jones K, Young SJ (1997) Unconstrained keyword spotting using phone lattices. Comput Speech Lang, vol. 11, p. 207-224, 1997.
Helen M. Meng, Wai-Kit Lo, Berlin Chen, and Karen Tang. 2001. Generating Phonetic Cognates to Handle Named Entities in English-Chinese Cross-Language Spoken Document Retrieval. Proceedings of ASRU.
James DA, Young SJ (1994) A fast lattice-based approach to vocabulary independent wordspotting. In: Proc. ICASSP 94, vol. 1, Adelaide, Australia. IEEE CS Press, Piscataway, N.J., pp. 377-380.
Clements, M., Cardillo, P., and Miller,M. (2001). Phonetic searching of digital audio. Broadcast Engineering Conference Proceedings. Washington: National Association of Broadcasters, pp. 131-140.
H. Bekker et al. “Inventory of Metadata for Multimedia”, Sep. 2000.
Phonetic Searching vs. LVCSR: How to Find What You Really Want in Audio Archives pp. 9-22(14) Authors: Cardillo P.S.; Clements M.; Miller M.S. Jan. 2002.
Yuk-Chi Li, Wai-Kit Lo, Helen M. Meng and P. C. Ching. 2000 Query expansion using phonetic confusions for Chinese spoken document retrieval. IRAL: Proceedings of the fifth international workshop on on Information retrieval with Asian languages.
Quackenbush et al. “Overview of MPEG-7 AudioOverview of MPEG-7 Audio” 2001.
Jones et al. “Retrieving Spoken Documents by Combining Multiple Index Sources” 1996.
Logan et al. “Confusion-Based Query Expansion for Oovwords in Spoken Document Retrieval” 2002.
Ng et al. “Experiments in spoken document retrieval using phoneme n-grams” 2000.
Amir et al. “Advances in Phonetic Word Spotting” 2001.
Ng et al. “Phonetic Recognition for Spoken Documents Retrieval” 1998.
Srinivasan et al. “Phonetic Confusion Matrix Based Spoken Document Retrieval” 2000.
Stuker. “Automatic Generation of Pronunciation Dictionaries” 2002.
Abberley, et al. “Retrieval of Broadcast News Documents With the Thisl System,” Proc. IEEE ICASSP'98, pp. 3781-3784 (1998).
Brown, et al. “Open Vocabulary Speech Indexing for Voice and Video Mail Retreival,” Proc. of ACM Multimedia pp. 307-316 (1996).
Deshmukh, et al. “Automated Generation of N-Best Pronunciations of Proper Nouns,” Proc. IEEE ICASSP'96 pp. 283-286 (1996).
Makhoul et al., “Speech and Language Technologies for Audio Indexing and Retrieval,” Proc. IEEE, 88(8), pp. 13381353 (2000).
International Search Report, Patent Cooperation Treaty, Oct. 8, 2004, 11 pages.
Preliminary Examination Report, Patent Cooperation Treaty, Feb. 2, 2006, 7 pages.
James D. A. et al., “A fast lattice-based approach to vocabulary independent wordspotting,” Acoustics, Speech, and Signal Processing, 1994. ICASSP-94, 1994 IEEE International Conference on Adelaide, SA, Australia Apr. 19-22, 1994, New York, NY, USA, IEEE, vol. i, pp. I-377, XP010133516.
Supplementary European Search Report for the Application No. 04757219.3-2225 dated Dec. 11, 2007, 4 pages.
Cooper et al.; “Building Searchable Collections of Enterprise Speech Data”; JCDL '01, Jun. 24-28, 2001; Roanoke, VA.
Choi et al.; “SCAN—SpeechContent Based Audio Navigator: A Systems Overview”; Proceedings of International Conference on Spoken Language Processing; Jan. 15, 1998.
Borsetti Greg A
Dorvil Richemond
Nexidia Inc.
Occhiuti Rohlicek & Tsao LLP
LandOfFree
Spoken word spotting queries does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Spoken word spotting queries, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Spoken word spotting queries will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2740827