摘要:
The invention provides a method for classifying the motion of an object such as a human being in a moving picture. A template is prepared in advance, which includes the Gabor wavelet expansion coefficients of an object image in a plurality of frames of a video image sequence representing each of a plurality of different reference motions of an object in a moving picture. Then, processing is performed to obtain the Gabor wavelet expansion coefficients of an object image in a plurality of frames of a video image sequence representing an unknown motion of the object. Matching factors are calculated based on the expansion coefficients for the unknown motion and the expansion coefficients for the reference motions in the template, and finally the unknown motion is classified based on the matching factors.
摘要:
An object of the present invention is to provide a description method for efficiently representing contents of motion picture with a small data volume. The organization of the present invention (1) represents a trajectory of how each object has moved over time by using reference plane representing position information of each object, (2) sets a description unit based on a type of action of each object by using changes in shape of each object, (3) has actions of each object represented as each behavioral section, and (4) comprises a description facility capable of reading and interpreting definition of an object dependent on video contents, definition of classes of actions, and definition of interpretation of a scene by interaction of plural objects.
摘要:
Edge information sensitive to a visual characteristic is efficiently stored, block artifacts are reduced, and highly efficient compression is accomplished by an image compressing method. The system encodes a digital image and includes: an image input for inputting the digital image; a segmenter for segmenting the digital image into a plurality of primitive regions and computing parameters about the luminance and chrominances of the primitive region for each the primitive region; a first merger for merging the plurality of primitive regions to generate first-order block candidates and classifying each of the first-order block candidates into any of a plurality of predetermined patterns; a first clusterer for clustering, among the first-order block candidates belonging to the same classification, the first-order block candidates, where the parameters about the luminance and chrominances of the primitive regions thereof can be approximated with linear transformation, as a first-order block, and representing a transformation coefficient of the linear transformation with a parameter; a second merger for merging a plurality of the first-order blocks to generate second-order block candidates and classifying the second-order block candidate in accordance with the pattern of each the first-order blocks of the second-order block candidate; a second clusterer for clustering, among the second-order block candidates belonging to the same classification, the second-order block candidates, where the transformation coefficients of the first-order blocks thereof can be approximated with linear transformation, as a second-order block, and representing a transformation coefficient of the linear transformation with a parameter; a controller for recursively executing the clustering of the block candidates while raising the order of the block in sequence until the clustering of the blocks becomes impossible; and an encoder for encoding the parameters of the coexisting multi-order blocks.
摘要:
An encoding method is provided with which users can select picture quality and a quantity of data in multiple stages and with which an image of higher picture quality can be regenerated by scalable selection, i.e., by further adding data to compressed data that can be decoded. Image data is compressed using: means (62) for segmenting an original image into a plurality of object regions where each region pixels all correlate with one another and for determining a hierarchical structure of the object regions; means (63) for approximating each of the object regions with at least one polygonal surface so that errors of a intensity of luminance and chrominances in each the pixel are less than a predetermined threshold value; means (64) for obtaining residual images by subtracting the approximated image from the original image or by subtracting a decompressed image of a compressed nth-order residual image from the original nth-order residual image; means (65) for compressing the nth-order residual image (n.gtoreq.1) by encoding; means (66) for storing the approximated image and the coded nth-order residual image; and means (67) for decompressing the compressed nth-order residual image from the nth-order residual image.
摘要:
The invention provide methods and apparatus for effectively identifying the occlusion of objects, such as persons, having a high degree of freedom. In an example embodiment, after initialization, an image is input, and an image region is extracted from image data. The distance is employed that is obtained when the shape of a two-dimensional histogram in the color space is transformed into the feature space. A graph is formed by using, the regions between the frames. A confidence factor is provided and image features are provided as weights to the edges that connect the nodes. Processing is performed, and the confidence factor is examined. A connection judged less possible to be a path is removed. When there is only one available connection for the occlusion point, this connection is selected.
摘要:
A method and an apparatus for using the trajectory of an object to access video contents, for example, to specify and display a specific video image scene. Such a video contents access method comprises the steps of: extracting objects from video contents; displaying movements of the objects as trajectories on a specific projection screen; specifying locations on the trajectories; and accessing a desired scene of the video contents. An apparatus is so designed that it performs the above method.
摘要:
A method and system for maneuvering a mobile robot are disclosed, which include a mobile robot having a plurality of sensors, including a sensor which detects the presence of an object and a sensor which detects image information related to the object. Utilizing data acquired by the plurality of sensors, elements of the object are determined. A hypothesis expressing the configuration of the object is then generated from the elements interpreted from the data. The hypothesis is generated by referencing a table in which interpretations of data acquired by the plurality of sensors are associated with possible configurations of objects in response to the generated hypothesis, the mobile robot is moved along a path which avoids the object.
摘要:
Methods and apparatus for effectively identifying the occlusion of objects, such as persons, having a high degree of freedom. In an example embodiment, after initialization, an image is input, and an image region is extracted from image data. The distance is employed that is obtained when the shape of a two-dimensional histogram in the color space is transformed into the feature space. A graph is formed by using the regions between the frames. A confidence factor is provided and image features are provided as weights to the edges that connect the nodes. Processing is performed and the confidence factor is examined. A connection judged less possible to be a path is removed. When there is only one available connection for the occlusion point, this connection is selected.
摘要:
The object of the invention is to extract from a moving picture and display at least a portion of team play, especially in a field sport, that is considered to be important for detailed analysis, so that the teamwork involved is easily discerned. A team play analysis apparatus comprises: a multi-resolution team play extraction unit for extracting a multi-resolution team play based on the trajectory of a player in the moving picture; a multi-resolution team play display unit for displaying at least a part of the multi-resolution team play through the interaction with a content creator; a parameter adjustment acceptance unit for accepting, from the content creator, the adjustment of a resolution in the multi-resolution team play display unit and the selection of groups, which constitute a series of desired plays to be picked up as an event, and collaborated movement of the groups; and a team play display unit for displaying a team play that is prepared through the adjustment and selection that are accepted by the parameter adjustment acceptance unit.
摘要:
An index generator that generates an index, which is data description contents, such as video contents, comprises: an index description device, for defining in advance basic index information concerning an index; a video display device for the input, the display or the output of contents to which an index is to be added; a triggering action input device, for accepting a triggering action in the contents that is displayed or output; and an index determination device, for generating index data based on the basic index information, which is defined by the index description device, and triggering action input history information, which is entered by the triggering action input device.