摘要:
In order to assist a viewer in understanding the contents of a moving image, it is desired to plainly represent the contents of metadata to the viewer. The metadata relevant to the moving image has a stream data structure including one or more access units each being a data unit which can be independently processed. Each of the access units includes first data to specify an effective period defined with respect to a time axis of the moving image, object area data describing a spatio-temporal region in the moving image, and balloon data to display relevant data of the object by a balloon. The balloon data includes text data to be displayed in the inside of the balloon and to express the contents of the object and position specifying data to specify a display position of the balloon.
摘要:
A video object clipping method includes storing, in a storage unit, original images each including a video object to be clipped and reference alpha images representing objects prepared, determining a criteria original image and a criteria reference alpha image from the original images and the reference alpha images, determining a deformation parameter by deforming the criteria reference alpha image to correspond to the criteria original image, and deforming remaining ones of the reference alpha images according to the determined deformation parameter to generate output alpha images corresponding to the original images.
摘要:
A video object clipping method includes storing, in a storage unit, original images each including a video object to be clipped and reference alpha images representing objects prepared, determining a criteria original image and a criteria reference alpha image from the original images and the reference alpha images, determining a deformation parameter by deforming the criteria reference alpha image to correspond to the criteria original image, and deforming remaining ones of the reference alpha images according to the determined deformation parameter to generate output alpha images corresponding to the original images.
摘要:
Since there is a case where reproduction of metadata is limited according to the processing power of a reproduction apparatus or the designation from a user, the invention provides a data structure in which metadata whose preferential reproduction is desired can be selected and reproduced. The metadata relevant to a moving image includes a stream data structure including one or more access units each being a data unit which can be independently processed, and each of the access units includes first data to specify an effective period defined with respect to a time axis of the moving image, object area data describing a spatio-temporal region in the moving image, second data including one of or both of data to specify a display method relevant to the spatio-temporal region and data to specify a processing to be performed when the spatio-temporal region is specified, and third data to specify, in a case where one or more access units exist on a same screen in the moving image at a time of reproduction of the metadata, reproduction priority of each of the access units.
摘要:
An image processing device includes an estimator, a first calculator, a selector, a determination module, and a third calculator. The estimator estimates a motion vector to reference images of an input video from a target pixel of a process target image. The first calculator calculates candidate pixel values corresponding to positions in the reference images. The selector selects a motion vector for which the error is small as many as the number smaller than the number of motion vectors acquired for the target pixel. The determination module determines the candidate pixel values corresponding to the selected motion vectors. The third calculator calculates a pixel value after correction of the target pixel from an arithmetic average or a weighted sum of the candidate pixel value determined equal to or less than the reference and the pixel value of the target pixel.
摘要:
An image processing apparatus sets respective pixel values of a template block which includes a pixel to be determined for each pixel to be determined set in sequence in an image, arranges a plurality of reference blocks so as to surround the template block, obtains respective block matching errors between the respective pixel values of the plurality of reference blocks and the respective pixel values of the template block, and determines that the pixel to be determined is in an edge area when a smallest value from among the block matching errors is a deviated value from all the block matching errors.
摘要:
A super-resolution device and method for setting at least one of a plurality of pixels included in image data as target pixels, the image data including pixels arranged in a screen and pixel values representing brightness, an area including the target pixel and peripheral pixels as a target area, and an area for searching pixel value change patterns in the target pixel area; calculating a difference between a first change pattern and second change pattern; comparing a difference between the first and second change patterns; calculating a pixel value of a super-resolution image having a number of pixels larger than a number of pixels included in the image data on the basis of a decimal-accuracy-vector, an extrapolated vector, and pixel values obtained from the image data.
摘要:
A scroll position estimation apparatus includes a unit acquiring a complex document including main data and sub-data items, the main data being for determining a layout of presentation of the complex document, a unit visualizing the main data, a unit presenting the visualized main data, a unit acquiring scroll positions indicating display areas of the presented main data, a unit storing pairs of the scroll positions and times of acquiring the scroll positions, a unit estimating a future scroll position based on the scroll positions and the times, a unit computing priority levels of the sub-data items based on the estimated scroll position, the priority levels being used to determine order of visualization, a unit sequentially visualizing the sub-data items in accordance with the computed priority levels of the sub-data items, and a unit presenting the visualized sub-data items.
摘要:
An apparatus includes unit acquiring, from an image source, an image represented by pixel values indicating brightness levels, unit setting, as a reference frame, one frame, unit sequentially setting, to be target pixels, each of pixels in one or more frames, unit setting target-image regions including the target pixels, unit searching the reference frame for similar-target-image regions similar to each of the target-image regions in a change pattern of pixel values, unit selecting, from the similar-target-image regions, corresponding points corresponding to each of the target pixels, unit setting sample values concerning brightness at the corresponding points to a pixel value of a target pixel corresponding to the corresponding points, and unit computing pixel values in a high-resolution image corresponding to the reference frame, based on the sample values and the corresponding points, the high-resolution image containing a larger number of pixels than pixels in the reference frame.
摘要:
An image analysis method for analyzing the motion of an object by using the time-series image data. The method comprises inputting time-series image data, generating time-series object image data by extracting an object region from the time-series image data, and analyzing the motion of the object from the time-series object image data.