Image analysis – Pattern recognition – Feature extraction
Reexamination Certificate
2002-10-24
2004-10-26
Boudreau, Leo (Department: 2621)
Image analysis
Pattern recognition
Feature extraction
C382S103000, C345S475000
Reexamination Certificate
active
06810148
ABSTRACT:
This application is based upon and claims the benefit of priority from the prior Japanese Patent Application No. 11-187033, filed Jun. 30, 1999, the entire contents of which are incorporated herein by reference.
BACKGROUND OF THE INVENTION
The present invention relates to a method of describing object region data such that information about an object region in a video is described, an apparatus for generating object region data such that information about an object region in a video is generated, a video processing apparatus arranged to be given an instruction about an object in a video to perform a predetermined process or retrieve an object in a video, and a video processing method therefor.
Hyper media are configured such that related information called a hyper link is given in between mediums, such as videos, sounds or texts, to permit mutual reference. When videos are mainly used, related information has been provided for each object which appears in the video. When the object is specified, related information (text information or the like) is displayed. The foregoing structure is a representative example of the hyper media. The object in the video is expressed by a frame number or a time stamp of the video, and information for identifying a region in the video which are recorded in video data or recorded as individual data.
Mask images have frequently been used as means for identifying a region in a video. The mask image is a bit map image constituted by giving different pixel values between the inside portion of an identified region and the outside portion of the same. A simplest method has an arrangement that a pixel value of “1” is given to the inside portion of the region and “0” is given to the outside portion of the same. Alternatively, &agr; values which are employed in computer graphics are sometimes employed. Since the &agr; value is usually able to express 256 levels of gray, a portion of the levels is used. The inside portion of the specified region is expressed as 255, while the outside portion of the same is expressed as 0. The latter image is called an &agr; map. When the regions in the image are expressed by the mask images, determination whether or not a pixel in a frame is included in the specified region can easily be made by reading the value of the pixel of the mask image and by determining whether the value is 0 or 255. The mask image has freedom with which a region can be expressed regardless of the shape of the region and even a discontinuous region can be expressed. The mask image must have pixels, the size of which is the same as the size of the original image. Thus, there arises a problem in that the quantity of data cannot be reduced.
To reduce the quantity of data of the mask image, the mask image is frequently compressed. When the mask image is a binary mask image constituted by 0 and 1, a process of a binary image can be performed. Therefore, the compression method employed in facsimile machines or the like is frequently employed. In the case of MPEG-4 in which ISO/IEC MPEG (Moving Picture Experts Group) has been standardized, an arbitrary shape coding method will be employed in which the mask image constituted by 0 and 1 and the mask image using the &agr; value are compressed. The foregoing compression method is a method using motion compensation and capable of improving compression efficiency. On the other hand, complex compression and decoding processes are required.
To express a region in a video, the mask image or the compressed mask image has usually been employed. However, data for identifying a region is required to permit easy and quick extraction, to be reduced in quantity and to permit easy handling.
On the other hand, the hyper media, which are usually assumed that an operation for displaying related information of a moving object in a video is performed, have somewhat difficulty in specifying the object as distinct from handling of a still image. A user usually has difficulty in specifying a specific portion. Therefore, it can be considered that the user usually aims, for example, a portion in the vicinity of the center of the object in a rough manner. Moreover, a portion adjacent to the object which is deviated from the object is frequently specified according to the movement of the object. Therefore, data for specifying a region is desired to be adaptable to the foregoing media. Moreover, an aiding mechanism for facilitating specification of a moving object in a video is required for the system for displaying related information of the moving object in the video.
As described above, the conventional method of expressing a desired object region in a video by using the mask image suffers from a problem in that the quantity of data cannot be reduced. The method arranged to compress the mask image raises a problem in that coding and decoding become too complicated. What is worse, directly accessing to the pixel of a predetermined frame cannot be performed, causing handling to become difficult.
There arises another problem in that a device for permitting a user to easily instruct a moving object in a video has not been provided.
BRIEF SUMMARY OF THE INVENTION
Accordingly, it is an object of the present invention to provide a method of describing object region data and an apparatus for generating object region data which are capable of describing a desired object region in a video by using a small quantity of data and facilitating generation of data and handling of the same.
Another object of the present invention is to provide a method of describing object region data, an apparatus for generating object region data, a video processing method and a video processing apparatus with which a user is permitted to easily instruct an object in a video and determine the object.
Another object of the present invention is to provide a method of describing object region data, an apparatus for generating object region data, a video processing method and a video processing apparatus with which retrieval of an object in a video can easily be performed.
According to one aspect of the present invention, there is provided a method of describing object region data such that information about an arbitrary object region in a video is described over a plurality of continuous frames, the method identifying a desired object region in a video according to at least either of a figure approximated to the object region or a characteristic point of the object region; approximating a trajectory obtained by arranging positions of representative points of the approximate figure or the characteristic points of the object region in a direction in which frames proceed with a predetermined function; and describing information about the object region by using the parameter of the function.
According to another aspect of the present invention, there is provided a method of describing object region data such that information about an arbitrary object region in a video is described over a plurality of continuous frames, the method describing the object region data by using information capable of identifying at least the frame number of a leading frame and the frame number of a trailing frame of the plurality of the subject frames or the time stamp of the leading frame and the time stamp of the trailing frame, information for identifying the type of the figure of an approximate figure approximating the object region, and the parameter of a function with which a trajectory obtained by arranging position data of representative points of the approximate figure corresponding to the object region in a direction in which frames proceed has been approximated.
According to another aspect of the present invention, there is provided a method of describing object region data such that information about an arbitrary object region in a video is described over a plurality of continuous frames, the method describing the object region data by using information capable of identifying at least the frame number of a leading frame and the frame number of a trailing frame of the plurality of
Hori Osamu
Kaneko Toshimitsu
Mita Takeshi
Yamamoto Koji
Akhavannik Hussein
Kabushiki Kaisha Toshiba
Oblon & Spivak, McClelland, Maier & Neustadt P.C.
LandOfFree
Method of describing object region data, apparatus for... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method of describing object region data, apparatus for..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method of describing object region data, apparatus for... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3299425