摘要:
A method of representing and analysing images comprises producing a plurality of descriptors of an image at one or more scales and for one or more colour channels, said descriptors capturing colour content and interrelation information within the regions, and associating the descriptors in a plurality of ways based on their characteristics such as scale, colour channel, feature semantics, and region, and comparing such representations of images to assess the similarity of images.
摘要:
A method of detecting a region having predetermined colour characteristics in an image comprises transforming colour values of pixels in the image from a first colour space to a second colour space, using the colour values in the second colour space to determine probability values expressing a match between pixels and the predetermined colour characteristics, where the probability values range over a multiplicity of values, using said probability values to identify pixels at least approximating to said predetermined colour characteristics, grouping pixels which at least approximate to said predetermined colour characteristics, and extracting information about each group, wherein pixels are weighted according to the respective multiplicity of probability values, and the weightings are used when grouping the pixels and/or when extracting information about a group.
摘要:
A method and apparatus for coding and decoding the fingerprint of a multimedia item such as video or audio is disclosed. A multimedia content temporal, such as a video segment or audio segment, is described by a coarse fingerprint and a plurality of fine fingerprints, each fine fingerprint corresponding to a temporal sub-interval of said temporal interval, said temporal sub-interval typically being smaller than said temporal interval. One or more fine fingerprints are encoded in a non-predictive way, with no reference to the temporally neighboring signatures, and one or more fine fingerprints are encoded in a predictive way, from the temporally neighboring signatures. The predictive encoding entails computing the difference between neighboring fine fingerprints to make up a prediction difference matrix, scanning said prediction difference matrix into a one dimensional vector by vectorising along rows or along columns or along diagonals or along any suitable scanning pattern, and performing lossless encoding on the one dimensional vector by an appropriate method, preferably selected, at least in part, based on the scanning method used.
摘要:
A method of representing and analysing images comprises producing a plurality of descriptors of an image at one or more scales and for one or more color channels, said descriptors capturing color content and interrelation information within the regions, and associating the descriptors in a plurality of ways based on their characteristics such as scale, color channel, feature semantics, and region, and comparing such representations of images to assess the similarity of images.
摘要:
A method of representing at least one image comprises deriving at least one descriptor based on colour information and colour interrelation information for at least one region of the image, the descriptor having at least one descriptor element, derived using values of pixels in said region, wherein at least one descriptor element for a region is derived using a non-wavelet transform. The representations may be used for image comparisons.
摘要:
A method of representing at least one image comprises deriving at least one descriptor based on color information and color interrelation information for at least one region of the image, the descriptor having at least one descriptor element, derived using values of pixels in said region, wherein at least one descriptor element for a region is derived using a non-wavelet transform. The representations may be used for image comparisons.
摘要:
A method of representing at least one image comprises deriving at least one descriptor based on color information and color interrelation information for at least one region of the image, the descriptor having at least one descriptor element, derived using values of pixels in said region, wherein at least one descriptor element for a region is derived using a non-wavelet transform. The representations may be used for image comparisons.
摘要:
A method and apparatus for processing a first sequence of images and a second sequence of images to compare the first and second sequences is disclosed. Each image of the first sequence and each image of the second sequence is processed by: (i) processing the image data for each of a plurality of pixel neighbourhoods in the image to generate at least one respective descriptor element for each of the pixel neighbourhoods; and (ii) forming an overall image descriptor from the descriptor elements. Each image in the first sequence is compared with each image in the second sequence by calculating a distance between the respective overall image descriptors of the images being compared. The distances are arranged in a matrix, and the matrix is processed to identify similar images.
摘要:
A method of classifying pixels in an image comprises combining (a) a method of identifying pixels of a first content type, wherein said method may also falsely identify pixels of a second content type as belonging to said first content type, with (b) a method of distinguishing between pixels of types including said first content type and pixels of types including said second content type, wherein method (b) comprises (c) a method for distinguishing between said first content type and content including said second content type and/or (d) a method for distinguishing between said second content type and content including said first content type.
摘要:
A method and apparatus for processing a first sequence of images and a second sequence of images to compare the first and second sequences is disclosed. Each of a plurality of the images in the first sequence and each of a plurality of the images in the second sequence is processed by (i) processing the image data for each of a plurality of pixel neighborhoods in the image to generate at least one respective descriptor element for each of the pixel neighborhoods, each descriptor element comprising one or more bits; and (ii) forming a plurality of words from the descriptor elements of the image such that each word comprises a unique combination of descriptor element bits. The words for the second sequence are generated from the same respective combinations of descriptor element bits as the words for the first sequence. Processing is performed to compare the first and second sequences by comparing the words generated for the plurality of images in the first sequences with the words generated for the plurality of images in the second sequence.