Generating hypermedia documents from transcriptions of...

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C348S468000, C348S563000

Reexamination Certificate

active

06473778

ABSTRACT:

FIELD OF THE INVENTION
The present invention relates generally to generating hypermedia documents, and more specifically to automatically creating hypermedia documents from conventional transcriptions of television programs.
BACKGROUND OF THE INVENTION
Today several broadcasters are publishing transcriptions of their television programs on their web sites. Some manually augment the transcripts to include still images or audio clips (e.g. www.pbs.org, www.cnn.com). However, the amount of manual labor required to generate these hypermedia documents limits the number of programs that can be converted to web content. Method useful in generating pictorial transcripts are disclosed in a patent application entitled “Method for Providing a Compressed Rendition of a Video Program in a Format Suitable for Electronic Searching and Retrieval,” U.S. Pat. No. 6,098,082, filed Jul. 16, 1996, and “Method and Apparatus for Compressing a Sequence of Information-Bearing Frames Having at Least Two Media Components,” U.S. Pat. No. 6,271,892,-B1the disclosures of which are incorporated herein by reference in their entirety.
A method for converting closed captioned video programs into hypermedia documents automatically within minutes after the broadcast of the program is described in Shahraray B., and Gibbon, D., “Automated Authoring of Hypermedia Documents of Video Programs”, Proc. Third Int. Conf. on Multimedia (ACM Multimedia '95), November 1995. However, the resulting quality of the pictorial transcript is a function of the level of skill of the closed caption operator and there are many errors of omission, particularly during periods of rapid dialog. Further, since the caption is typically transmitted in upper case, an automatic case restoration process must be performed. This process is complex since it requires dynamically updated databases of proper nouns, as well as higher level processing to handle ambiguous cases. Conventional transcripts of television programs however, are of higher quality since the time has been taken to assure that the dialog is accurately represented, and of course, case restoration is unnecessary.
SUMMARY OF THE INVENTION
The present invention is an apparatus, method and computer program product for producing an enriched time-referenced text stream using a time-referenced text stream and an enriched text stream. The method includes the steps of receiving the time-referenced text stream and the enriched text stream; aligning the text of the enriched text stream with the text of the time-referenced text stream; and transferring time references from the time-referenced text stream to the enriched text stream based on the alignment to produce an enriched time-referenced text stream. In one embodiment, the time-referenced text stream is a closed-captioned text stream associated with a media stream the enriched text stream is a transcript associated with the media stream.
The method further includes the steps of receiving a multimedia stream; extracting the closed-captioned text stream from the multimedia stream; receiving a portion of a media stream of the multimedia stream; and linking a portion of the enriched time-referenced text stream with the portion of the media stream based on the time references to produce a hypermedia document.
In one embodiment, the method includes the steps of receiving a user request to generate a hypermedia document; and generating a hypermedia document in response to the user request using a selected template. The selected template can be specified by the user.
Further features and advantages of the present invention, as well as the structure and operation of various embodiments of the present invention are described in detail below with reference to the accompanying drawings. In the drawings, like reference numbers indicate identical or functionally similar elements. Additionally, the left-most digit(s) of a reference number identifies the drawing in which the reference number first appears.


REFERENCES:
patent: 5481296 (1996-01-01), Cragun et al.
patent: 5649060 (1997-07-01), Ellozy et al.
patent: 5664227 (1997-09-01), Mauldin et al.
patent: 5737725 (1998-04-01), Case
patent: 6025837 (2000-02-01), Matthews et al.
patent: 6076059 (2000-06-01), Glickman et al.
patent: 6098082 (2000-08-01), Gibbon et al.
patent: 6243676 (2001-06-01), Witteman
patent: 6263507 (2001-07-01), Ahmad et al.
patent: 6271892 (2001-08-01), Gibbon et al.
patent: 2001/0018693 (2001-08-01), Jain et al.
Lemay, Laura. Teach Yourself Web Publishing with HTML 4. Second Edition. 1997. Sam.net Publishing. pp. 731-732, 927-929.*
Intelligent Multimedia Information Retrieval, Chapter 11Informedia: News-on-Demand Multimedia Information Acquisition and Retrieval, Alexander G. Hauptmann and Michael J. Witbrock (Mark T. Maybury ed., AAAI Press 1997).
Michael J. Witbrock and Alexander G. Hauptmann,Improving Acoustic Models by Watching Television, Carnegie Mellon University CMU-CS-98-110, 1998.
William A. Gale and Kenneth W. Church,A Program for Aligning Sentences in Bilingual Corpora, Computational Linguistics, 1993.
Church, K.Char-align: A Program for Aligning Parallel Texts at the Character Level, Association for Computational Linguistics, pp. 9-16, 1993.
Joan Bachenko, Jeffrey Daugherty, and Eileen Fitzpatrick,A Parser for Real-Time Speech Synthesis of Conversational Texts, Proceedings of the ACL Conference on Applied Natural Language Processing, Apr. 1992.
Behzad Shahraray,Scene Change Detection and Content-Based Sampling of Video Sequences, Digital Video Compression: Algorithms and Technologies 1995, Proceedings of the SPIE 2419, Feb. 1995.
Patrick A. V. Hall and Geoff R. Dowling,Approximate String Matching, ACM Computing Survey, vol. 12, No. 4, 1980.
Daniel S. Hirschberg,Algorithms for the longest common subsequence problem, Journal of the ACM, 24(4):664-675, Oct. 1977.
Robert A. Wagner and Michael J. FischerThe String-to-String Correction Problem, Journal of the ACM, 21(1):168-173, Jan. 1974.
Artificial Intelligence Frontiers in Statistics: Al and Statistics, Chapter 21A Statistical Approach to Aligning Sentences in Bilingual Corpora, (D.J. Hand ed., Chapman & Hall 1993).
B. Shahraray et al., “Automated Authoring of Hypermedia Documents of Video Program”,Proc. Third Int. Conf. on Multimedia(ACM Multimedia '95) San Francisco, CA (11/95).

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Generating hypermedia documents from transcriptions of... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Generating hypermedia documents from transcriptions of..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Generating hypermedia documents from transcriptions of... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2981700

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.