System and method for describing multimedia content

Image analysis – Pattern recognition – Feature extraction

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C382S181000, C382S190000, C382S236000, C382S305000, C382S311000, C345S418000, C345S473000, C345S215000, C345S960000, C358S403000, C707S793000, C707S793000, C707S793000, C707S793000

Reexamination Certificate

active

06490370

ABSTRACT:

TECHNICAL FIELD OF THE INVENTION
The present invention is directed, in general, to video processing systems and more specifically, to a system for identifying and describing the content of visual animated data.
BACKGROUND OF THE INVENTION
The advent of digital television (DTV), the increasing popularity of the Internet, and the introduction of consumer multimedia electronics, such as compact disc (CD) and digital video disc (DVD) players, have made tremendous amounts of multimedia information available to consumers. As video and animated graphics content becomes readily available and products for accessing it reach the consumer market, searching, indexing and identifying large volumes of multimedia data becomes even more challenging and important.
The term “visual animated data” herein refers to natural video, as well as to synthetic 2D or 3D worlds (e.g., VRML), or to a mixture of both video and graphics (e.g., MPEG-4). Different criteria are used to search and index the content of visual animated data, such as a video clip. Video processing systems have been developed for searching frames of visual animated data to detect, identify and label objects of a particular shape or color, or to detect text in the frames, such as subtitles, advertisement text, or background image text, such as a street sign or a “HOTEL” sign.
Presently under development is a new MPEG standard, MPEG-7, which is intended to establish a standard set of “descriptors” that can be used to describe different aspects of visual animated data. The descriptors, or combinations of descriptors and description schemes, directly describe the content of visual animated data, such as a video clip, thereby providing a fast and efficient way to search through an archive of video files and animated graphics files. MPEG-7 is intended to standardize some descriptors and description schemes in a comprehensive description definition language (DDL) to describe the content of visual animated data.
A descriptor, at its most basic, is a representation of an attribute of a feature (or object) in visual animated data. A feature can be something very basic, such as the color of a pixel in a specific frame in a movie, or a feature can be something more conceptual and broad, such as the name of the movie or the age of the character portrayed within the story of the movie. Collections of related descriptors are called description schemes. This language for creating these descriptors and description schemes is called a “description definition language” or DDL.
One goal of MPEG-7 is to allow content creators and content editors to describe any feature of visual animated data content in a manner that can be used by others and can be used for searching and retrieving the visual animated data content by the final consumers. Descriptors are coded so that they can be transmitted and stored efficiently. The MPEG-7 standard, however, is far from completion and many of its intended objectives may never be realized. Additionally, many of the MPEG-7 standard proposals include a full language for creating descriptors. The proposed languages allow a descriptor creator to specify the descriptor in a freeform manner using the syntax and semantics of the specific language. This is a “scriptbased” approach in which each descriptor is a script that can be used whenever a specific feature needs to be described. Under this approach, one descriptor may look nothing like any other descriptor in the DDL. Thus, the descriptors and description schemes that are created may be highly individualized with little commonality according to the choices of the descriptor creator.
There is therefore a need in the art for improved systems and methods for searching and indexing the content of visual animated data including video clips. More particularly, there is a need for a description definition language (DDL) that implements highly structured descriptors and description schemes that are readily recognizable and searchable by parser programs and other applications that detect and analyze descriptor information associated with visual animated data.
SUMMARY OF THE INVENTION
To address the above-discussed deficiencies of the prior art, it is a primary object of the present invention to provide a template containing a standard set of attributes that can be used to describe any feature. Each template comprises a descriptor. A user may describe a feature using a standard template and fill in values to create the descriptor. Using the description definition language to create descriptors, a content creator can describe the lower-level individual features of the multimedia content being created. The content creator can also describe the relationships between these lower level features and collect descriptors into logical groupings using description schemes.
All descriptors and description schemes created in accordance with the principles of the present invention are based on the standard template with some variations. Using a predefined template or set of templates, rather than script-based descriptors, makes the descriptors and description schemes of a visual animated data file easily recognizable and searchable.
Accordingly in one embodiment of the present invention, there is provided a video processing device capable of generating a descriptor data structure representative of a selected feature in a visual animated data file. The video processing device comprises: 1) user input means capable of selecting the selected feature and generating a plurality of attribute values associated with the selected feature; and 2) an image processor capable of identifying the selected feature in the visual animated data file and receiving the plurality of attribute values from the user input means and, in response to receipt of the plurality of attribute values, generating the descriptor data structure by inserting selected ones of the plurality of attribute values into corresponding ones of a plurality of pre-defined attribute fields in a standard descriptor template.
According to one embodiment of the present invention, the image processor is further capable of associating the descriptor data structure with the visual animated data file to thereby produce a modified visual animated data file, wherein the selected feature may be identified in the modified visual animated data file by examining the descriptor data structure.
According to another embodiment of the present invention, the selected feature is an object appearing in the visual animated data file and the descriptor data structure contains attribute values representative of the object.
According to still another embodiment of the present invention, the selected feature is an image frame in the visual animated data file and the descriptor data structure contains attribute values representative of the image frame.
According to yet another embodiment of the present invention, the selected feature is a sequence of image frames in the visual animated data file and the descriptor data structure contains attribute values representative of the sequence of image frames.
According to a further embodiment of the present invention, the descriptor template further comprises a plurality of user-defined attribute fields and wherein the image processor is capable of receiving a plurality of user-defined attribute values from the user input means and inserting selected ones of the plurality of user-defined attribute values in corresponding ones of the user-defined attribute fields.
According to a still further embodiment of the present invention, the plurality of pre-defined attribute fields in a standard descriptor template comprises a unique identification (ID) attribute field, wherein the plurality of pre-defined attribute fields are the same for descriptor data structures having the same ID attribute field.
The foregoing has outlined rather broadly the features and technical advantages of the present invention so that those skilled in the art may better understand the detailed description of the invention that follows. Additional features a

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

System and method for describing multimedia content does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with System and method for describing multimedia content, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for describing multimedia content will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2962926

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.