Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2000-02-03
2003-01-28
Dorvil, Richemond (Department: 2741)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S270000
Reexamination Certificate
active
06513003
ABSTRACT:
FIELD OF THE INVENTION
The invention relates to the field of communications, and more particularly to the delivery of audio and other media broadcasts combined with high-accuracy, synchronous textual streams reflecting the dialogue in that media.
BACKGROUND OF THE INVENTION
The robust growth in demand for both media content and delivery channels has increased the need for novel types of information, news, financial and other services. The Internet and other network technologies have enabled a variety of multipoint media streams, such as news Websites containing streamable video clips, audio clips and other media combinations. One frequent type of news source is a collective meeting or proceeding, in which one or a few speakers discuss information of interest to a wide audience. Those types of settings include sessions of Congress, presidential and other news conferences, corporate analysts' meetings, media conferences and other group events.
In the case of sessions of Congress and other governmental bodies, the timely delivery of the information content is particularly valuable. Many interested parties could benefit from prompt knowledge of pending provisions in legislation, rulings in court cases and other deliberations. For instance, individuals or organizations that would be affected by the enactment of pending legislation may want to furnish input to their representatives. Or constituents may want to take other actions to contribute or adjust to new statutory, regulatory or other programs.
The federal government deploys a host of communications facilities situated at a variety of sources, often issuing permits for access to those resources. For instance, the U.S. Congress permits press access to its chambers and hearing rooms, from which live video and audio feeds are generated for delivery to commercial networks, news and other organizations.
However, in the instance of legislative reporting, there is a particular demand for written records of the legislature's activities. Public and private organizations exist which take down and transcribe the activities of both chambers. Those Congressional transcripts are typically made available in hard copy or electronic format within about 48 hours from the time of the legislative sessions, for a subscription fee. This is in contrast to audio or visual feeds for network TV or other delivery, which are often contemporaneous with the debates and other activities. The media, the public, interest groups as well as the government bodies themselves would benefit from more timely and robust delivery of both live media and concurrent textual streams of the dialogue.
SUMMARY OF THE INVENTION
The invention relates to a system and method for the integrated delivery of media and synchronized transcription, in which a dedicated network collects, processes and delivers unified audio, video and textual content on a live basis to subscribers. In one regard, the invention may incorporate front-end audio or video servers which sense and collect the audible or video activities of a legislature, press conference, town meeting or other event.
The raw, digitized media feeds from the event are transmitted to a centralized distribution server, which in turn delivers the digitized stream of the event to a remote transcription facility, where automated and human transcription stages decode the dialogue taking place. After speech recognition and editing take place, the textual content is synchronized with the original audio, video or other media and delivered to subscribers, for instance via a Web site interface. Subscribers may configure the delivery modes according to their preference, for instance to silently parse the textual steam for key words, triggering full-screen, audible, wireless or other delivery of the audio or video content when a topic of interest is discussed.
The subscribers may alternatively choose to view and hear the media and textual output continuously, and may access archives for the purpose of reproducing text for research or editorial activities.
REFERENCES:
patent: 4041467 (1977-08-01), Cota et al.
patent: 4430726 (1984-02-01), Kasday
patent: 4866770 (1989-09-01), Seth-Smith et al.
patent: 4924387 (1990-05-01), Jeppesen
patent: 4965440 (1990-10-01), Hasegawa
patent: 5031113 (1991-07-01), Hollerbauer
patent: 5249050 (1993-09-01), Zato
patent: 5267155 (1993-11-01), Buchanan et al.
patent: 5280430 (1994-01-01), Woods et al.
patent: 5289523 (1994-02-01), Vasile et al.
patent: 5315386 (1994-05-01), Muramoto
patent: 5327176 (1994-07-01), Forler et al.
patent: 5345270 (1994-09-01), Saeger et al.
patent: 5347365 (1994-09-01), Harigai et al.
patent: 5347632 (1994-09-01), Filepp et al.
patent: 5369704 (1994-11-01), Bennett et al.
patent: 5428400 (1995-06-01), Landis et al.
patent: 5438370 (1995-08-01), Primiano et al.
patent: 5448474 (1995-09-01), Zamora
patent: 5477274 (1995-12-01), Akiyoshi et al.
patent: 5500920 (1996-03-01), Kupiec
patent: 5537151 (1996-07-01), Orr et al.
patent: 5539920 (1996-07-01), Menand et al.
patent: 5543850 (1996-08-01), Pratt et al.
patent: 5543851 (1996-08-01), Chang
patent: 5543852 (1996-08-01), Yuen et al.
patent: 5563804 (1996-10-01), Mortensen et al.
patent: 5572260 (1996-11-01), Onishi et al.
patent: 5594809 (1997-01-01), Kopec et al.
patent: 5615131 (1997-03-01), Mortensen et al.
patent: 5627594 (1997-05-01), van Gestel
patent: 5630060 (1997-05-01), Tang et al.
patent: 5648789 (1997-07-01), Beadles et al.
patent: 5649060 (1997-07-01), Ellozy et al.
patent: 5689620 (1997-11-01), Kopec et al.
patent: 5703655 (1997-12-01), Corey et al.
patent: 5724481 (1998-03-01), Garberg et al.
patent: 5740245 (1998-04-01), Bennett et al.
patent: 5745184 (1998-04-01), Neal
patent: 5758080 (1998-05-01), Mortensen et al.
patent: 5768375 (1998-06-01), Yamaguchi et al.
patent: 5799276 (1998-08-01), Komissarchik et al.
patent: 5801782 (1998-09-01), Patterson
patent: 5815196 (1998-09-01), Alshawi
patent: 5822523 (1998-10-01), Rothschild et al.
patent: 5822528 (1998-10-01), Amano
patent: 5828836 (1998-10-01), Westwick et al.
patent: 5861883 (1999-01-01), Cuomo et al.
patent: 5870454 (1999-02-01), Dahlen
patent: 5883675 (1999-03-01), Herz et al.
patent: 5883896 (1999-03-01), Kopec et al.
patent: 5884256 (1999-03-01), Bennett et al.
patent: 5884277 (1999-03-01), Khilsa
patent: 5887243 (1999-03-01), Harvey et al.
patent: 5896129 (1999-04-01), Murphy et al.
patent: 5915092 (1999-06-01), Ludwig et al.
patent: 5949952 (1999-09-01), Bennett et al.
patent: 5959687 (1999-09-01), Dinwiddie et al.
patent: 5970141 (1999-10-01), Bennett et al.
patent: 5982448 (1999-11-01), Reyes
patent: 5983005 (1999-11-01), Monteiro et al.
patent: 5996000 (1999-11-01), Shuster
patent: 6005561 (1999-12-01), Hawkins et al.
patent: 6014706 (2000-01-01), Cannon et al.
patent: 6023675 (2000-02-01), Bennett et al.
patent: 6026395 (2000-02-01), Bennett et al.
patent: 6185527 (2001-02-01), Petkovic et al.
patent: 6345252 (2002-02-01), Beigi et al.
patent: WO 96/24840 (1996-09-01), None
patent: WO 98/34217 (1998-08-01), None
Proceedings of the Speech Recognition Workshop. Maison et al., “Audio visula speaker recognition for video broadcast news: some fusion techniques”. Pp. 161-167. 1999.*
ICASSP-97. 1997 IEEE International Conference on Acoustics, Speech and Signal Processing. ROy et al., “Speaker Identification Based Text to Audio Alignment for an Audio Retrieval System”. Pp. 1099-1102. Apr. 1997.*
Huangfu, J. et al., “Synchronized Captioning System Using MPEG-4 and SPHINX”, 18-899 Special Topics in Signal Processing Midsemester Project Report,Electrical and Computer Engineering, Mar. 1998, XP002172106, [accessed May 21, 2001], 2 pages.
Witbrock, M. et al., “Speech Recognition and Information Retrieval: Experiments in Retrieving Spoken Documents”, Proceedings of the Darpa Speech Recognition Workshop 1997, Feb. 1997, Virginia, XP002172107, [accessed May 21, 2001], 5 pages.
Yu, G. et al., “Identification of Speakers Engaged in Dialog”,IEEE, New York, Apr. 1993, XP000427806, ISBN: 0-7803-0946-4, Abstract, pp. II-383-386.
Angell Philip S.
Benson Matthew B.
Bivings, Jr. Frank Gary
Haque Mohammad A.
Levine Jason D.
Fair Disclosure Financial Network, Inc.
Testa Hurwitz & Thibeault LLP
LandOfFree
System and method for integrated delivery of media and... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with System and method for integrated delivery of media and..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for integrated delivery of media and... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3026268