摘要:
In order to assist a viewer in understanding the contents of a moving image, it is desired to plainly represent the contents of metadata to the viewer. The metadata relevant to the moving image has a stream data structure including one or more access units each being a data unit which can be independently processed. Each of the access units includes first data to specify an effective period defined with respect to a time axis of the moving image, object area data describing a spatio-temporal region in the moving image, and balloon data to display relevant data of the object by a balloon. The balloon data includes text data to be displayed in the inside of the balloon and to express the contents of the object and position specifying data to specify a display position of the balloon.
摘要:
Since there is a case where reproduction of metadata is limited according to the processing power of a reproduction apparatus or the designation from a user, the invention provides a data structure in which metadata whose preferential reproduction is desired can be selected and reproduced. The metadata relevant to a moving image includes a stream data structure including one or more access units each being a data unit which can be independently processed, and each of the access units includes first data to specify an effective period defined with respect to a time axis of the moving image, object area data describing a spatio-temporal region in the moving image, second data including one of or both of data to specify a display method relevant to the spatio-temporal region and data to specify a processing to be performed when the spatio-temporal region is specified, and third data to specify, in a case where one or more access units exist on a same screen in the moving image at a time of reproduction of the metadata, reproduction priority of each of the access units.
摘要:
A resolution enhancing method of a video includes reducing a training video, extracting a high-frequency component from the training video, calculating a first feature vector including a feature amount of a first spatio-temporal box in the reduced video, storing pairs of the first feature vectors and second spatio-temporal boxes in the high-frequency component videos at the same positions as those of the first spatio-temporal boxes, expanding an input video, retrieving a first feature vector similar to a second feature vector including a feature amount of a third spatio-temporal box of an object of the input video to be processed, as an element, and adding a second spatio-temporal box making a pair with the retrieved first feature vector to a fourth spatio-temporal box in the expanded video at the same position as that of the third spatio-temporal box in order to generate an output video.
摘要:
Although it is necessary to display a mouse cursor in order to specify an object area in a moving image, there is a fear that a standard cursor spoils the atmosphere of the moving image or the intention of a producer. Then, a data structure is desired in which the shape of the cursor can be freely specified according to the contents of the moving image or the object. In metadata of the moving image, the object area appearing in the moving image is divided into plural data (access units), and the access unit is made to be capable of specifying the cursor shape, and the cursor shape in the object area is made freely changeable.
摘要:
There is provided a structure of metadata which can express an object of a complicated shape. There are added vcr_shape_num indicating the number of sub-planar areas constituting an object area, vcr_rule_code indicating that an integration system to integrate the sub-planar areas and to determine a final object area is written, vcr_rule_length to indicate the length of data of the integration system, and data vcr_rule to indicate the integration system, and thereafter, vcr_subregion_data of data of each sub-planar area appearing in vcr_rule is described continuously by the number indicated by vcr_shape_num.
摘要:
A resolution enhancing method of a video includes reducing a training video, extracting a high-frequency component from the training video, calculating a first feature vector including a feature amount of a first spatio-temporal box in the reduced video, storing pairs of the first feature vectors and second spatio-temporal boxes in the high-frequency component videos at the same positions as those of the first spatio-temporal boxes, expanding an input video, retrieving a first feature vector similar to a second feature vector including a feature amount of a third spatio-temporal box of an object of the input video to be processed, as an element, and adding a second spatio-temporal box making a pair with the retrieved first feature vector to a fourth spatio-temporal box in the expanded video at the same position as that of the third spatio-temporal box in order to generate an output video.
摘要:
In the case where metadata refers to relevant content on a network, since the relevant content on the network can not be necessarily permanently held, it is desirable that a time limit for making reference can be set. Besides, in order to provide relevant content which can be referred to only in a limited period for promoting sales, or in order to provide relevant content which can be referred to only in a limited period such as, for example, Christmas Day every year for raising the entertainingness of the metadata, it is desirable that a reference time limit of the relevant content can be set. Then, in order to set an effective time limit of an access unit, a flag to specify the effective time limit and the effective time limit are described in each access unit.
摘要:
A method and apparatus for reproducing metadata provides a data structure in which metadata whose preferential reproduction is desired can be selected and reproduced. The metadata relevant to a moving image includes a stream data structure including one or more access units each being a data unit which can be independently processed, and each of the access units includes first data to specify an effective period defined with respect to a time axis of the moving image, object area data describing a spatio-temporal region in the moving image, second data including one of or both of data to specify a display method relevant to the spatio-temporal region and data to specify a processing to be performed when the spatio-temporal region is specified, and third data to specify, in a case where one or more access units exist on a same screen in the moving image at a time of reproduction of the metadata, reproduction priority of each of the access units.
摘要:
There is provided a structure of metadata in which an operation at a time of object designation can be changed according to the kind of an input interface used for input or a method of an input operation. The metadata relevant to a moving image has a stream data structure including one or more access units each being a data unit which can be independently processed, and each of the access units includes first data to specify an effective period defined with respect to a time axis of the moving image, object area data describing a spatio-temporal region in the moving image, plural second data each specifying processing to be performed when the spatio-temporal region is designated, and third data to cause the respective second data to correspond to kinds of input interfaces or operation methods by which a user designates the object.
摘要:
There is provided an invention in which in a moving image hypermedia including moving image data and its metadata, when a viewer specifies an object, it is easy to confirm that the specification has been successfully made, and the specified object can be easily confirmed. Metadata relevant to a moving image has a stream data structure including one or more access units each being a data unit which can be independently processed, and each of the access units includes first data to specify an effective period defined with respect to a time axis of the moving image, object area data describing a spatio-temporal region in the moving image, second data to specify a processing to be performed when the spatio-temporal region is specified, third data including a display method for causing the spatio-temporal region to be recognized, and fourth data to specify, in a case where an event relevant to the spatio-temporal region is generated by a cursor operated by a user, calling the third data.