Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2007-06-12
2007-06-12
Harper, V. Paul (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S257000, C704S243000
Reexamination Certificate
active
10384273
ABSTRACT:
An approach to alignment of transcripts with recorded audio is tolerant of moderate transcript inaccuracies, untranscribed speech, and significant non-speech noise. In one aspect, a number of search terms are formed from the transcript such that each search term is associated with a location within the transcript. Possible locations of the search terms are then determined in the audio recording. The audio recording and the transcript are then aligned using the possible locations of the search terms. In another aspect a search expression is accepted, and then a search is performed for spoken occurrences of the search expression in an audio recording. This search includes searching for text occurrences of the search expression in a text transcript of the audio recording, and searching for spoken occurrences of the search expression in the audio recording.
REFERENCES:
patent: 5199077 (1993-03-01), Wilcox et al.
patent: 5333275 (1994-07-01), Wheatley et al.
patent: 5649060 (1997-07-01), Ellozy et al.
patent: 5701153 (1997-12-01), Reichek et al.
patent: 5787414 (1998-07-01), Miike et al.
patent: 5822405 (1998-10-01), Shaun
patent: 6023675 (2000-02-01), Bennett et al.
patent: 6076059 (2000-06-01), Glickman et al.
patent: 6260011 (2001-07-01), Heckerman et al.
patent: 6317710 (2001-11-01), Huang et al.
patent: 6345253 (2002-02-01), Viswanathan
patent: 6505153 (2003-01-01), Van Thong et al.
patent: 2002/0143544 (2002-10-01), Gschwendtner
patent: 000877378 (1998-11-01), None
Clements et al., “Phonetic Searching of Digital Audio,” Broadcast Engineering Conference, Las Vegas, Nevada, Apr. 2001, pp. 1-10.
Hauptmann et al., “Text, Speech, and Vision for Video Segmentation: The Informedia Project,” AAAI, Fall 1995, pp. 1-6.
Moreno et al., “A recursive algorithm for the forced alignment of very long audio segments,” ICSLP-1998, pp. 1-4.
Boreczky et al. “A Hidden Markov Model Framework for Video Segmentation Using Audio and Image Features,”Proc. ICASSP'98(1998).
Brown, et al. “Open Vocabulary Speech Indexing For Voice and Video Mail Retreival,”Proc. of ACM Multimediapp. 307-316 (1996).
Choi et al. “SCAN—Speech Content based Audio Navigator: A systems overview,”Proc. ICSLP'98(1998).
Cooper, et al. “Building Searchable Collections of Enterprise Speech Data,”.
deVries “Radio and Television Information Filtering through Speech Recognition,”.
Dharanipragada, et al. “Audio-Indexing For Broadcasting News,”Proceedings of TREC6(1997).
Dharanipragada, et al. “Experimental Results in Audio Indexing,”Proceedings of TREC6(1997).
Foote, et al. “An Overview of Audio Information Retrieval,”ACM-Springer Multimedia Systems(1998).
Gelin, et al. “Keyword Spotting for Video Soundtrack Indexing,”Proc. IEEE ICASSP'96vol. 1: pp. 299-302 (1996).
Hirschberg, et al. “Finding Information In Audio: A New Paradigm For Audio Browsing and Retrieval,”In Proceeding of the ESCA ETRW Workshop(1999).
Kimber, “Speaker Segmentation for Browsing Recorded Audio,”Proc. ACM CHI'95(1995).
Makhoul et al., “Speech and Language Technologies for Audio Indexing and Retrieval,”Proc. IEEE, 88(8), pp. 13381353 (2000).
Roy, et al. “Audio Meeting History Tool: Interactive Graphical User-Support for Virtual Audio Meetings,”Proceedings of ESCA Workshop on Accessing Information in Spoken Audio, pp. 107-110 (1999).
Roy, et al. “Speaker Identification Based Text to Audio Alignment For An Audio Retrieval System,”Proc. of the Int. Conf. Acoustics, Speech and Signal Processingvol. 2: pp. 1099-1103 (1997).
Harper V. Paul
Nexidia Inc.
Occhiuti Rohlicek & Tsao LLP
LandOfFree
Transcript alignment does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Transcript alignment, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Transcript alignment will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3828004