Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2007-06-05
2009-02-03
Lerner, Martin (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S270000, C707S793000
Reexamination Certificate
active
07487086
ABSTRACT:
An approach to alignment of transcripts with recorded audio is tolerant of moderate transcript inaccuracies, untranscribed speech, and significant non-speech noise. In one aspect, a number of search terms are formed from the transcript such that each search term is associated with a location within the transcript. Possible locations of the search terms are then determined in the audio recording. The audio recording and the transcript are then aligned using the possible locations of the search terms. In another aspect a search expression is accepted, and then a search is performed for spoken occurrences of the search expression in an audio recording. This search includes searching for text occurrences of the search expression in a text transcript of the audio recording, and searching for spoken occurrences of the search expression in the audio recording.
REFERENCES:
patent: 4779209 (1988-10-01), Stapleford et al.
patent: 5199077 (1993-03-01), Wilcox et al.
patent: 5333275 (1994-07-01), Wheatley et al.
patent: 5649060 (1997-07-01), Ellozy et al.
patent: 5701153 (1997-12-01), Reichek et al.
patent: 5729741 (1998-03-01), Liaguno et al.
patent: 5787414 (1998-07-01), Miike et al.
patent: 5822405 (1998-10-01), Astarabadi
patent: 5835667 (1998-11-01), Wactlar et al.
patent: 6023675 (2000-02-01), Bennett et al.
patent: 6076059 (2000-06-01), Glickman et al.
patent: 6260011 (2001-07-01), Heckerman et al.
patent: 6317710 (2001-11-01), Huang et al.
patent: 6345253 (2002-02-01), Viswanatham
patent: 6434520 (2002-08-01), Kanevsky et al.
patent: 6505153 (2003-01-01), Van Thong et al.
patent: 6507838 (2003-01-01), Syeda-Mahmood
patent: 6859803 (2005-02-01), Dagtas et al.
patent: 6901207 (2005-05-01), Watkins
patent: 7039585 (2006-05-01), Wilmot et al.
patent: 7089188 (2006-08-01), Logan et al.
patent: 7139756 (2006-11-01), Cooper et al.
patent: 7231351 (2007-06-01), Griggs
patent: 7263484 (2007-08-01), Cardillo et al.
patent: 2002/0143544 (2002-10-01), Gschwendtner et al.
patent: 2003/0004724 (2003-01-01), Kahn et al.
patent: 000877378 (1998-11-01), None
Clements et al. “Phonetic Searching of Digital Audio” Broadcast Engineering Conference, Las Vegas, Nevada, Apr. 2001, pp. 1-10.
Clements et al., “Phonetic Searching of Digital Audio,” Broadcast Engineering Conference, Las Vegas, Nevada, Apr. 2001, pp. 1-10.
Hauptmann et al., “Text, Speech, and Vision for Video Segmentation: The Informedia Project,” AAAI, Fall 1995, pp. 1-6.
Moreno et al., “A recursive algorithm for the forced alignment of very long audio segments,” ICSLP-1998, pp. 1-4.
Boreczky et al., “A Hidden Markov Model Framework for Video Segmentation Using Audio and Image Features,” Proc. ICASSP'98, 1998.
Brown et al., “Open Vocabulary Speech Indexing for Voice and Video Mail Retrieval,” Proc. of ACM Multimedia, pp. 307-316 (1996).
Choi et al., “Scan-Speech Content based Audio Navigator: A systems overview,” Proc ICSLP'98 (1998).
Cooper et al., “Building Searchable Collections of Enterprise Speech Data.”.
deVries “Radio and Television Information Filtering Through Speech Recognition.”.
Dharanipragada et al., “Audio-Indexing for Broadcasting News,” Proceedings of TREC6 (1997).
Dharanipragada et al., “Experimental Results in Audio Indexing,” Proceedings of TREC6 (1997).
Foote et al., “An Overview of Audio Information Retrieval, ” ACM-Springer Multimedia Systems (1998).
Gelin et al., “Keyword Spotting for Video Soundtrack Indexing,” Proc. IEEE ICASSP'96 vol. 1: pp. 299-302 (1996).
Hirschbert et al., “Finding Information in Audio: a New Paradigm for Audio Browsing and Retrieval,” In Proceeding of the ESCA ETRW Workshop (1999).
Kimber, “Speaker Segmentation for Browsing Recorded Audio,” Proc. ACM CHI'95 (1995).
Makhoul et al., “Speech and Language Technologies for Audio Indexing and Retrieval,” Proc. IEEE, 88(8), pp. 1338-1353 (2000).
Roy et al., “Audio Meeting History Tool: Interactive Graphical User-Support for Virtual Audio Meetings,” Proceedings of ESCA Workshop on Accessing Information in Spoken Audio, pp. 107-110 (1999).
Roy et al., “Speaker identification Based Text to Audio Alignment for an Audio Retrieval System,” Proc. of the Int. Conf. Acoustics, Speech and Signal Processing, vol. 2: pp. 1099-1103 (1997).
Lerner Martin
Nexidia Inc.
Occhiuti Rohlicek & Tsao LLP
LandOfFree
Transcript alignment does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Transcript alignment, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Transcript alignment will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4140438