摘要:
An object region data describing method of describing information about the object region in an image over a plurality of frames determines the region of a target object in an image using an approximate function that approximates the trajectory obtained by arranging, in the direction of frame advance, one representative point of an approximate figure for the object region and the difference values for determining the other representative points and describes information about the object region using a parameter for the function.
摘要:
A data processing method processes image data including spatiotemporal locator information which is added. The spatiotemporal locator information represents a spatiotemporal area which is a temporal transition of an object region over a plurality of frames. The method sets a curved surface defined by a periphery of the spatiotemporal area based on the spatiotemporal locator information, calculates time when a reference trajectory crosses the curved surface, determines whether coordinate values of the reference trajectory are inside or outside of the object region in at least one of frames in which the object region exists, and outputs information relating to whether or not the reference trajectory passes the spatiotemporal area.
摘要:
An object region data describing method of describing information about the object region in an image over a plurality of frames determines the region of a target object in an image using an approximate function that approximates the trajectory obtained by arranging, in the direction of frame advance, one representative point of an approximate figure for the object region and the difference values for determining the other representative points and describes information about the object region using a parameter for the function.
摘要:
A moving-picture processing apparatus includes an acquisition unit configured to acquire metadata including information about each temporal region in an input moving picture with a plurality of temporal regions, a decision unit configured to determine a cutout region corresponding to at least any one of the plurality of temporal regions on the basis of the metadata, and a cutting-out unit configured to cut out the cutout region from an image in each frame of the input moving picture.
摘要:
A spatiotemporal locator processing method of correcting a spatiotemporal locator capable of specifying a trajectory of a representative point of an approximate figure representing an arbitrary region in order to represent a transition of the region over a plurality of frames in video data, obtains the trajectory of the representative point based on the spatiotemporal locator, displays the obtained trajectory of the representative point on a screen, receives input of a correction instruction for the trajectory displayed on the screen, and corrects the spatiotemporal locator based on the correction instruction.
摘要:
An object region data describing method of describing information about the object region in an image over a plurality of frames determines the region of a target object in an image using an approximate function that approximates the trajectory obtained by arranging, in the direction of frame advance, one representative point of an approximate figure for the object region and the difference values for determining the other representative points and describes information about the object region using a parameter for the function.
摘要:
In order to assist a viewer in understanding the contents of a moving image, it is desired to plainly represent the contents of metadata to the viewer. The metadata relevant to the moving image has a stream data structure including one or more access units each being a data unit which can be independently processed. Each of the access units includes first data to specify an effective period defined with respect to a time axis of the moving image, object area data describing a spatio-temporal region in the moving image, and balloon data to display relevant data of the object by a balloon. The balloon data includes text data to be displayed in the inside of the balloon and to express the contents of the object and position specifying data to specify a display position of the balloon.
摘要:
Since there is a case where reproduction of metadata is limited according to the processing power of a reproduction apparatus or the designation from a user, the invention provides a data structure in which metadata whose preferential reproduction is desired can be selected and reproduced. The metadata relevant to a moving image includes a stream data structure including one or more access units each being a data unit which can be independently processed, and each of the access units includes first data to specify an effective period defined with respect to a time axis of the moving image, object area data describing a spatio-temporal region in the moving image, second data including one of or both of data to specify a display method relevant to the spatio-temporal region and data to specify a processing to be performed when the spatio-temporal region is specified, and third data to specify, in a case where one or more access units exist on a same screen in the moving image at a time of reproduction of the metadata, reproduction priority of each of the access units.
摘要:
Although it is necessary to display a mouse cursor in order to specify an object area in a moving image, there is a fear that a standard cursor spoils the atmosphere of the moving image or the intention of a producer. Then, a data structure is desired in which the shape of the cursor can be freely specified according to the contents of the moving image or the object. In metadata of the moving image, the object area appearing in the moving image is divided into plural data (access units), and the access unit is made to be capable of specifying the cursor shape, and the cursor shape in the object area is made freely changeable.
摘要:
There is provided a structure of metadata which can express an object of a complicated shape. There are added vcr_shape_num indicating the number of sub-planar areas constituting an object area, vcr_rule_code indicating that an integration system to integrate the sub-planar areas and to determine a final object area is written, vcr_rule_length to indicate the length of data of the integration system, and data vcr_rule to indicate the integration system, and thereafter, vcr_subregion_data of data of each sub-planar area appearing in vcr_rule is described continuously by the number indicated by vcr_shape_num.