摘要:
The present invention provides a method and system for dynamically updating descriptions of AV content, such as MPEG-7 content descriptions. One embodiment provides for a method in which first a server issues a command indicating the type of update to make to the description on a client computer. Specifically, the update may be to add, delete, or change a node in a tree or graph structure describing the AV content. For example, the command may specify an update to an instance document or its associated schema. The server also specifies the location in the description to perform the update, which may be at an absolute or relative address. Next, the server sends the data, if any, to be added or used for the change. Embodiments also provide for steps to check that the update is authorized.
摘要:
The color tone compensation method provides a simple and efficient method to compensate the color tone differences between two different sources of images. A first image sample, such as a still image, from a first image capturing source and a second image sample, such as a video frame, from a second image capturing source are aligned, and a tone-mapping estimation routine is applied to the two aligned images. The tone-mapping estimation routine uses the pixel intensity value histograms associated with the two aligned images and generates a tone mapping table. The tone mapping table includes a conversion intensity value for each intensity value in the second image. The conversion intensity value is a statistical measure, such as the mean, calculated according to the data in the corresponding pixel intensity value histogram. The tone-mapping table is applied to any image generated by the second image capturing source, thereby generating a new image with similar color tone as the first image generated by the first image capturing source.
摘要:
Motion Compensated Prediction (MCP) has been a key factor in most advanced video compression schemes. For further reduction in the residual signal energy in B-frames, bidirectional prediction where two motion-compensated signals are superimposed has also been utilized in most prior video coding standards such as MPEG-2 or MPEG-4/AVC. Syntax changes and appropriate motion vector prediction that allows efficient use of multi-parameter MCP is described. The prediction signal is constructed by linearly combining the motion-compensated signals from each parameter (or motion vector).
摘要:
An apparatus or method of efficiently coding high resolution video using texture analysis and synthesis techniques in a scalable video coding framework. A high-resolution video signal is spatially downsampled and encoded into a base-layer. Texture and structure information are extracted from the downsampled signal and base-layer for use by a texture synthesizer. The texture synthesizer is structurally and texturally aware utilizing edge information from a base-layer of the synthesizer to improve synthesis. After synthesis, a video quality assessor directs enhancement-layer coding of unacceptably synthesized areas by alternate (non-textural) coding means, such as conventional AVC or MPEG-2 coding. In one mode, the quality assessor iteratively improves synthesis of certain blocks to make them acceptable for enhancement-layer coding. The apparatus or method outputs a bit stream, or bit streams, containing both the coded base-layer and the enhancement-layer.
摘要:
An image processing apparatus and an image processing method, applied to e.g. a TV conference system or a TV telephone system, in which an image to be transmitted or received is captured and reconstructed into a virtual view point image which may appear as if it is captured by a virtual camera. The correspondence between pixels may be taken with higher robustness and higher accuracy, while mismatch in such correspondence, otherwise caused due to disparity, may be reduced to as small a value as possible. To this end, the disparity between the images, obtained on imaging an object from different view points by at least two cameras 11a, 12a, are detected and, based on the detected disparity, each of the images is separated into plural image regions. The relation of correspondence is found of the separated each image region, in association with the object, from one pixel position on the same horizontal lines to the next. From the pixel positions and the associated luminance components, for which the relation of correspondence has been found, a virtual view point image, to be generated by a virtual camera, mounted imaginarily, is constructed.
摘要:
The currently existing ISO/AVC file format is modified by providing extensions to store and access video content currently being defined by the SVC standard. Specifically, extensions to the AVC file format are made to provide a new SVC file format that enables the storage and access of scalable video data. The scalable video data is stored as a single track within a media data section of the SVC file format. New extensions are defined for description entries and boxes within a metadata section of the SVC file format. These extensions provide means for extracting sub-streams or layers from the single track of scalable video data stored in the media data section.
摘要:
A method permitting compact ways to update relationships between entities in an audiovideo sequence, or serial set of sequences, the method comprising: writing a description between the entities, the description containing relations; determining the relations that may be represented by parameters, each parameter having a numerical value; and obtaining from the user, one or more of the following: (a) the numerical value for the parameter; (b) a description of the parameter containing the numerical value; and (c) a description capable of setting the parameter dynamically.
摘要:
A method of coarse representation of a visual object's shape for search/query/filtering applications uses a binding box that fully encompasses the object of interest within the image to extract a feature vector. Once the feature vector is available, matching based on specific queries may be performed using a search engine to compare the query number to an appropriate element of the feature vector, performing sorting to pick the best matches.
摘要:
A system and method of coding (encoding and/or decoding) video content to extend file formats for storage. The system and method utilizes the concept to define additional sample group description entries. By way of example the method can comprise the steps of: (1) receiving a file with encoded media data as a scalable video codec stream; (2) extracting information identifying the various spatial resolutions, temporal resolutions, quality resolutions or combinations of spatio-temporal-quality resolutions from the media data; (3) generating new description entries and dependency grouping box; (4) populating boxes with extracted metadata; and (5) incorporating metadata into a file associated with the media data using a specific media file format.
摘要:
A method and apparatus for organizing data pertaining to audiovisual content are described. According to one embodiment, an exemplary method for organizing data pertaining to audiovisual content includes defining at least one descriptive list for a descriptive portion of the data pertaining to audiovisual content, defining at least one accessing list for an accessing portion of the data pertaining to audiovisual content, and generating a matrix that connects the accessing list to the descriptive list.