Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2007-12-25
2007-12-25
Harper, V. Paul (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S007000, C707S793000, C707S793000
Reexamination Certificate
active
11609138
ABSTRACT:
An improved method and apparatus is disclosed which uses probabilistic techniques to map an input search string with a prestored audio file, and recognize certain portions of a search string phonetically. An improved interface is disclosed which permits users to input search strings, linguistics, phonetics, or a combination of both, and also allows logic functions to be specified by indicating how far separated specific phonemes are in time.
REFERENCES:
patent: 4481593 (1984-11-01), Bahler
patent: 4802231 (1989-01-01), Davis
patent: 4896358 (1990-01-01), Bahler et al.
patent: 5199077 (1993-03-01), Wilcox et al.
patent: 5218668 (1993-06-01), Higgins et al.
patent: 5406423 (1995-04-01), Sato
patent: 5440662 (1995-08-01), Sukkar
patent: 5454062 (1995-09-01), La Rue
patent: 5509104 (1996-04-01), Lee et al.
patent: 5526444 (1996-06-01), Kopec et al.
patent: 5557789 (1996-09-01), Mase et al.
patent: 5621849 (1997-04-01), Sakurai et al.
patent: 5649057 (1997-07-01), Lee et al.
patent: 5701452 (1997-12-01), Siefert
patent: 5732394 (1998-03-01), Nakadai et al.
patent: 5748840 (1998-05-01), La Rue
patent: 5787414 (1998-07-01), Miike et al.
patent: 5794194 (1998-08-01), Takebayashi et al.
patent: 5797123 (1998-08-01), Chou et al.
patent: 5822405 (1998-10-01), Astarabadi
patent: 5822409 (1998-10-01), Chang et al.
patent: 5822729 (1998-10-01), Glass
patent: 5826260 (1998-10-01), Byrd et al.
patent: 5832430 (1998-11-01), Lleida et al.
patent: 5842163 (1998-11-01), Weintraub
patent: 5884262 (1999-03-01), Wise et al.
patent: 5895464 (1999-04-01), Bhandari et al.
patent: 5909662 (1999-06-01), Yamazaki et al.
patent: 5918222 (1999-06-01), Fukui et al.
patent: 5918223 (1999-06-01), Blum et al.
patent: 5950159 (1999-09-01), Knill
patent: 5987457 (1999-11-01), Ballard
patent: 6023659 (2000-02-01), Seilhamer et al.
patent: 6023677 (2000-02-01), Class et al.
patent: 6023726 (2000-02-01), Saksena
patent: 6073095 (2000-06-01), Dharanipragada et al.
patent: 6169986 (2001-01-01), Bowman et al.
patent: 6185527 (2001-02-01), Petovic et al.
patent: 6260011 (2001-07-01), Heckerman et al.
patent: 6317710 (2001-11-01), Huang et al.
patent: 6345253 (2002-02-01), Viswanathan
patent: 6363377 (2002-03-01), Kravets et al.
patent: 6434520 (2002-08-01), Kanevsky et al.
patent: 6873993 (2005-03-01), Charlesworth et al.
patent: 6985861 (2006-01-01), Van Thong et al.
patent: 7113910 (2006-09-01), Pereira et al.
patent: 2002/0052870 (2002-05-01), Charlesworth et al.
patent: 2003/0110035 (2003-06-01), Thong et al.
patent: 2004/0083099 (2004-04-01), Scarano et al.
patent: 2005/0216443 (2005-09-01), Morton et al.
patent: 0177854 (1989-06-01), None
patent: 0398574 (1990-11-01), None
Abberley, et al. “Retrieval of Broadcast News Documents With The Thisl System,”Proc. IEEE ICASSP'98, pp. 3781-3784 (1998).
Abberley, et al. “The Thisl Broadcast News Retrieval System,”ESCA ETRW Workshop on Accessing Information in Spoken Audio(1999).
Abberley, et al. “The THISL SDR system at TREC-8,”In Proceedings of the 8th Text Retrieval Conference (TREC-8). (1999).
Alvarez-Cercadillo, et al. “Context Modeling Using RNN For Keyword Protection,”Proc. IEEE ICASSP'93vol. I: pp. 569-572 (1993).
Bahl, et al. “Maximum Mutual Information Estimation of Hidden Markov Model Parameters for Speech Recognition,”Proc. IEEE ICASSP'86, vol. I: pp. 49-52 (1986).
Bakis “Spoken Word Spotting via Centisecond Acoustic States,”IBM Technical Disclosure Bulletin18(10) (1976).
Boreczky et al. “A Hidden Markov Model Framework for Video Segmentation Using Audio and Image Features,”Proc. ICASSP'98(1998).
Bourlard, et al. “Optimizing Recognition and Rejection Performance in Wordspotting Systems,”Proc. IEEE ICASSP'94vol. 1: pp. 373-376.
Brown, et al. “Open Vocabulary Speech Indexing For Voice and Video Mail Retreival,”Proc. of ACM Multimediapp. 307-316 (1996).
Chang, et al. “High-Performance Low-Complexity Wordspotting Using Neural Networks,”IEEE Trans. Signal Processing, vol. 45 No. 11 pp 2864-2870 (1997).
Chang, et al. “Improving Wordspotting Performance With Artificially Generated Data,”Proc. IEEE ICASSP'96vol. 1: pp. 526-529 (1996).
Choi et al. “SCAN—Speech Content based Audio Navigator: A systems overview,”Proc. ICSLP'98(1998).
Choi, et al. “An Overview of the AT&T Spoken Document Retrieval,”DARPA/NIST Broadcast News Transcription and Understanding Workshop(1998).
Cooper, et al. “Building Searchable Collections of Enterprise Speech Data.”
Deshmukh, et al. “Automated Generation of N-Best Pronunciations of Proper Nouns,”Proc. IEEE ICASSP'96pp. 283-286 (1996).
deVries “Radio and Television Information Filtering through Speech Recognition.”
Dharanipragada, et al “A Fast Vocabulary Independent Algorithm For Spotting Words in Speech,”Proc. IEEE ICASSP'98, pp. 233-236 (1998).
Dharanipragada, et al. “Audio-Indexing For Broadcasting News,”Proceedings of TREC6(1977).
Dharanipragada, et al. “New Word Detection in Audio-Indexing,”Proc 1997 Workshop on Automatic Speech Recognition and Understanding, pp. 551-557 (1997).
Dharanipragada, et al. “Experimental Results in Audio Indexing,”Proceedings of TREC6(1997).
Foote, et al. “An Overview of Audio Information Retrieval,”ACM-Springer Multimedia Systems(1998).
Garofolo et al., “The TREC Spoken Document Retrieval Track: A Success Story,”Proc. TREC-8, pp. (2000).
Gelin, et al. “Keyword Spotting for Video Soundtrack Indexing,”Proc. IEEE ICASSP'96vol. l: pp. 299-302 (1996).
Higgins, et al. “Keyword Recognition Using templace Concatenation,”Proc. IEEE ICASSP'85, vol. III: pp. 1233-1236 (1985).
Hirschberg, et al. “Finding Information In Audio: A New Paradigm For Audio Browsing and Retrieval,”In Proceeding of the ESCA ETRW Workshop(1999).
Hofstetter, et al. “Techniques for Task Independent Word Spotting In Continuous Speech Messages,”Proc. IEEE ICASSP'92, vol. II: pp. 101-104 (1992).
Huang et al. “A Fast Algorithm for Large Vocabulary Keyword Spotting Application,”Proc. IEEE Trans, On Speech and Audio Proc., 2(3) (1994).
Itoh, et al. “Sentence Spotting Applied to Partieal Sentences and Unknown Words,” Proc. IEEE ICASSP'94 pp. I-369-372 (1994).
James “A System For Unrestricted Topic Retrieval From Radio News Broadcasts,”Proc. IEEE ICASSP'96pp. 279-282 (1996).
James, et al. “A Fast Lattice-Based Approach to Vocabulary Independent Wordspotting,”Proc. ICASSPvol. 1: pp. 377-380 (1994).
Jeanrenaud et al. “Phonetic-based Word Spotter: Various Configurations and Applications to Event Spotting.” Proc. Eurospeech'93, vol. II, pp. 1057-1060 (1993).
Jeanrenaud, et al. “Spotting Events in Continuous Speech,”Proc. IEEE ICASSP'94, vol. I: pp. 381-384 (1994).
Johnson et al. “The Cambridge University Spoken Document Retrieval System,”Proc. IEEE ICASSP'99(1999).
Jones, et al. “Robust Talker-Independent Audio Document Retrieval,”Proc. IEEE ICASSP'96, pp. 311-314 (1996).
Jones, et al. “Video Mail Retrieval: The Effect of Word Spotting Accuracy on Precision,”Proc. IEEE ICASSP'95pp. 309-312 (1995).
Junkawitsch, et al. “A New Keyword Algorithm With Pre-Calculated Optimal Thresholds,” Proc.ICSLP'96, pp. 2067-2070 (1996).
Kimber, “Speaker Segmentation for Browsing Recorded Audio,”Proc. ACM CHI'95(1995).
Knill, et al. “Fast Implementaiton Methods for Viterbi-Based Word Spotting,”Proc. IEEE ICASSP'96(1996).
Knill, et al. “Speaker Dependent Keyword Spotting for Accessing Stored Speech,”Technical Report CUED/F-INFENG/TR 193(1994).
Kosonocky, et al. “A Continuous Density Neural Tree Network Word Spotting System,”Proc. IEEE ICASSP'95vol. I: pp. 305-308 (1995).
Kuhn “On Talker-Independent Word Recognition
Cardillo Peter S.
Clements Mark A.
Price William E.
Georgia Tech Research Corporation
Harper V. Paul
Occhiuti Rohlicek & Tsao LLP
LandOfFree
Phonetic searching does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Phonetic searching, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Phonetic searching will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3885789