摘要:
An object of the present invention is to provide a description method for efficiently representing contents of motion picture with a small data volume. The organization of the present invention (1) represents a trajectory of how each object has moved over time by using reference plane representing position information of each object, (2) sets a description unit based on a type of action of each object by using changes in shape of each object, (3) has actions of each object represented as each behavioral section, and (4) comprises a description facility capable of reading and interpreting definition of an object dependent on video contents, definition of classes of actions, and definition of interpretation of a scene by interaction of plural objects.
摘要:
The invention provides a method for classifying the motion of an object such as a human being in a moving picture. A template is prepared in advance, which includes the Gabor wavelet expansion coefficients of an object image in a plurality of frames of a video image sequence representing each of a plurality of different reference motions of an object in a moving picture. Then, processing is performed to obtain the Gabor wavelet expansion coefficients of an object image in a plurality of frames of a video image sequence representing an unknown motion of the object. Matching factors are calculated based on the expansion coefficients for the unknown motion and the expansion coefficients for the reference motions in the template, and finally the unknown motion is classified based on the matching factors.
摘要:
Edge information sensitive to a visual characteristic is efficiently stored, block artifacts are reduced, and highly efficient compression is accomplished by an image compressing method. The system encodes a digital image and includes: an image input for inputting the digital image; a segmenter for segmenting the digital image into a plurality of primitive regions and computing parameters about the luminance and chrominances of the primitive region for each the primitive region; a first merger for merging the plurality of primitive regions to generate first-order block candidates and classifying each of the first-order block candidates into any of a plurality of predetermined patterns; a first clusterer for clustering, among the first-order block candidates belonging to the same classification, the first-order block candidates, where the parameters about the luminance and chrominances of the primitive regions thereof can be approximated with linear transformation, as a first-order block, and representing a transformation coefficient of the linear transformation with a parameter; a second merger for merging a plurality of the first-order blocks to generate second-order block candidates and classifying the second-order block candidate in accordance with the pattern of each the first-order blocks of the second-order block candidate; a second clusterer for clustering, among the second-order block candidates belonging to the same classification, the second-order block candidates, where the transformation coefficients of the first-order blocks thereof can be approximated with linear transformation, as a second-order block, and representing a transformation coefficient of the linear transformation with a parameter; a controller for recursively executing the clustering of the block candidates while raising the order of the block in sequence until the clustering of the blocks becomes impossible; and an encoder for encoding the parameters of the coexisting multi-order blocks.
摘要:
An encoding method is provided with which users can select picture quality and a quantity of data in multiple stages and with which an image of higher picture quality can be regenerated by scalable selection, i.e., by further adding data to compressed data that can be decoded. Image data is compressed using: means (62) for segmenting an original image into a plurality of object regions where each region pixels all correlate with one another and for determining a hierarchical structure of the object regions; means (63) for approximating each of the object regions with at least one polygonal surface so that errors of a intensity of luminance and chrominances in each the pixel are less than a predetermined threshold value; means (64) for obtaining residual images by subtracting the approximated image from the original image or by subtracting a decompressed image of a compressed nth-order residual image from the original nth-order residual image; means (65) for compressing the nth-order residual image (n.gtoreq.1) by encoding; means (66) for storing the approximated image and the coded nth-order residual image; and means (67) for decompressing the compressed nth-order residual image from the nth-order residual image.
摘要:
A head detecting apparatus, including a foreground extraction section for extracting a foreground region in which a person is captured from an input image; a first main axis computing section which includes a first moment computing section for computing a moment around a center of gravity of the foreground region and calculating a main axis of the foreground region based on the moment around the center of gravity of the foreground region; a head computing section for computing a head region included in the foreground region as a part thereof based on the main axis of the foreground region and a shape of the foreground region; and an ellipse determining section for determining an ellipse to be applied to a person's head based on a shape of the head region.
摘要:
Provides evaluating programs that enable problems to be identified in a manner that reflects actual conditions for an evaluation of contents of interest. In an example embodiment, there is provided a program for evaluating contents of interest that causes a computer to implement the functions of: performing a primary evaluation of contents of interest on the basis of one evaluation criterion; performing a primary evaluation of the contents of interest on the basis of another evaluation criterion; and performing a secondary evaluation of the contents of interest on the basis of a plurality of the first evaluations.
摘要:
A method for extracting invisible information from an object in which visible information overlaps with at least part of invisible information formed in stealth ink. The method includes the steps of: irradiating the object with light for making the stealth ink luminous; receiving reflected light from the object; extracting image information from the received reflected light; splitting the image information into a plurality of pieces of color-channel information; obtaining a correlation function between the pixel values of at least two color-channel information selected from the plurality of pieces of color-channel information; reducing a visible information component of selected one of the plurality of pieces of color-channel information to thereby interpolate the color-channel information; and extracting invisible information from the interpolated one piece of color-channel information.
摘要:
The present invention provides a means to display the contents of a document using a selected display condition, while preserving the layout of the document. It provides an information processing system comprising: a web browser for displaying a document having a predetermined layout; and a display controller for controlling a method used by the web browser to display the document. The display controller includes: a layout structure analyzer for analyzing the structure of the layout for the document; a region arrangement determiner for dividing a web page under a desired display condition, whereby the contents of the page are displayed in order to display the document in accordance with regions that are allocated and that reflect the structure of the document layout obtained by the layout structure analyzer; and an intra-region contents determiner for determining which contents of the document are to be displayed inside each of the allocated regions that are determined by the region arrangement determiner.
摘要:
A thermal printer of the present invention is basically constructed to use roll form printing paper, but able to use also fan-fold form printing paper by mounting to a casing; a guide shaft for fan-fold form printing paper instead of a roll for roll form printing paper; and a guide unit for fan-fold form printing paper, provided with side plates for regulating both side edges of fan-fold printing paper and with a press plate for restricting the upper surface of the same, and mounted to the casing in condition of being connected to an opening between the casing and the rear end of a cover for the casing.The thermal printer of the present invention has a pair of reel plates to which the roll for roll form printing paper or the guide shaft for fan-fold form printing paper are mounted having larger diameter than the roll for roll form printing paper prior to use or guide shaft for fan-fold form printing paper, so that roll form or fan-fold form printing paper, when in use, is free from the skew, thereby enabling accurate and reliable paper feeding.
摘要:
Digest screen display content deciding means selects display elements belonging to respective regions of a document based on display priorities of the display elements, which are obtained by digest screen display priority information creating means, and decides selected display elements as display content of a digest screen under a condition where a total display area does not exceed a required display area. A merging relationship among the regions is set based on layout information for the regions, created by digest screen region layout information creating means. Display content deciding means decides the display content of a detail screen based on the merging relationship among the regions, and creates a digest of the detail screen based on control information created by control information creating means. Moreover, digest screen display content changing means changes the display content of the digest screen in response to an operation of a user.