Multimedia summary generation employing user feedback

Electrical computers and digital processing systems: multicomput – Computer-to-computer protocol implementing – Computer-to-computer data streaming

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C709S236000, C709S223000, C707S793000, C707S793000, C707S793000, C707S793000, C707S793000, C707S793000

Reexamination Certificate

active

06557042

ABSTRACT:

TECHNICAL FIELD
This invention relates to multimedia content summarization. More particularly, the invention relates to generating and providing summaries of multimedia content based on user feedback.
BACKGROUND OF THE INVENTION
Multimedia streaming—the continuous delivery of synchronized media data like video, audio, text, and animation—is a critical link in the digital multimedia revolution. Today, streamed media is primarily about video and audio, but a richer, broader digital media era is emerging with a profound and growing impact on the Internet and digital broadcasting.
Synchronized media means multiple media objects that share a common timeline. Video and audio are examples of synchronized media—each is a separate data stream with its own data structure, but the two data streams are played back in synchronization with each other. Virtually any media type can have a timeline. For example, an image object can change like an animated .gif file, text can change and move, and animation and digital effects can happen over time. This concept of synchronizing multiple media types is gaining greater meaning and currency with the emergence of more sophisticated media composition frameworks implied by MPEG-4, Dynamic HTML, and other media playback environments.
The term “streaming” is used to indicate that the data representing the various media types is provided over a network to a client computer on a real-time time, as-needed basis, rather than being pre-delivered in its entirety before playback. Thus, the client computer renders streaming data as it is received from a network server, rather than waiting for an entire “file” to be delivered.
In comparison to text-based or paper-based presentations, multimedia presentations can be very advantageous. Synchronized audio/visual presentations, for example, are able to capture and convey many subtle factors that are not perceivable from paper-based documents. Even when the content is a spoken presentation, an audio/visual recording captures gestures, facial expressions, and various speech nuances that cannot be discerned from text or even from still photographs.
Although streaming multimedia content compares favorably with textual content in most regards, one disadvantage is that it requires significant time for viewing. It cannot be “skimmed” like textual content. Thus, a “summarized” version of the multimedia content would be very helpful.
Various technologies are available for summarizing or “previewing” different types of media content. For example, technology is available for removing pauses from spoken audio content. Audio content can also be summarized with algorithms that detect “important” parts of the content as identified by pitch emphasis. Similarly, techniques are available for removing redundant or otherwise “unimportant” portions or frames of video content. Similar schemes can be used with other types of media streams, such as animation streams and script streams.
Although such techniques are available for previewing media content, these techniques lack a semantic understanding of the multimedia content. These techniques rely on assumptions regarding the multimedia content based on the way in which the presentation is made (e.g., the manner in which words and sentences are spoken, the manner in which video frames are sequenced, etc.), rather than an understanding of the importance of the different portions of the multimedia content.
Furthermore, current techniques lack the ability to distinguish between different groups of users. Different groups of people (e.g., the sales department and the legal department of a corporation) may feel that different portions of the multimedia content are interesting. Current techniques do not provide any ability to distinguish between such different user interests.
The invention described below addresses these disadvantages of summarizing multimedia content, providing an improved way to summarize such content.
SUMMARY OF THE INVENTION
A system includes a multimedia server computer or other device that can provide multimedia content, as well as summaries of the multimedia content, to one or more client computers. Summaries are generated to include those portions of the multimedia content that are most interesting to previous users, as identified by feedback from the previous users. Thus, the summary presented to a user includes the portions identified as interesting by previous users.
According to one aspect of the invention, each of the users of a client computer is identified as being part of a group and different summaries are generated for each group. Each summary includes those portions of the multimedia content that are most interesting to previous users of the corresponding group. Thus, the summary presented to a user includes only the portions identified as interesting by previous similar users (that is, users in the same group).
According to another aspect of the invention, the summaries are continually updated as each user is presented with the multimedia content and/or a summary of the content. Feedback from each user is collected and used to further refine the summary. For example, a portion of the multimedia content may be dropped from the summary if feedback from subsequent users indicate the portion is not interesting.
According to another aspect of the invention, the multimedia content is separated into multiple different segments or portions. The segments may be pre-determined or alternatively may be dynamically defined. Each of these segments is given a different “score” for each group. These scores are then modified as user feedback is received. User feedback indicating a segment is interesting increases the score of that segment, while user feedback indicating a segment is not interesting decreases the score of that segment. The highest scoring segments are then provided as the summary of the multimedia content.
According to another aspect of the invention, the user feedback includes both explicit and implicit feedback. A user's inputs during presentation of the multimedia content and/or the summary of the multimedia content are monitored. Explicit feedback may be provided by the user, such as selection of a “this is interesting” button or a “this is not interesting” button. Additionally, implicit feedback may be provided by the user, such as selection of a fast forward button (implying the portion is not interesting) or selection of a rewind or replay button (implying the portion is interesting).


REFERENCES:
patent: 5572651 (1996-11-01), Weber et al.
patent: 5592608 (1997-01-01), Weber et al.
patent: 5799292 (1998-08-01), Hekmatpour
patent: 5953506 (1999-09-01), Kalra et al.
patent: 6029195 (2000-02-01), Herz
patent: 6195654 (2001-02-01), Wachtel
patent: 6230205 (2001-05-01), Garrity et al.
patent: 6317710 (2001-11-01), Huang et al.
patent: 6366956 (2002-04-01), Krisnan
patent: 6385619 (2002-05-01), Eichstaedt et al.
patent: 6446130 (2002-09-01), Grapes
Informedia—Internet References, http://www.informedia.cs.cmu.edu, date unknown.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Multimedia summary generation employing user feedback does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Multimedia summary generation employing user feedback, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Multimedia summary generation employing user feedback will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3051914

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.