摘要:
Each image of a set of images is characterized with a set of sparse feature descriptors and a set of dense feature descriptors. In some embodiments, both the set of sparse feature descriptors and the set of dense feature descriptors are calculated based on a fixed rotation for computing texture descriptors, while color descriptors are rotation invariant. In some embodiments, the descriptors of both sparse and dense features are then quantized into visual words. Each database image is represented by a feature index including the visual words computed from both sparse and dense features. A query image is characterized with the visual words computed from both sparse and dense features of the query image. A rotated local Bag-of-Features (BoF) operation is performed upon a set of rotated query images against the set of database images. Each of the set of images is ranked based on the rotated local Bag-of-Features operation.
摘要:
A method of generating training documents for training a classifying device comprises, with a processor, sampling from a distribution of words in a number of original documents, and creating a number of pseudo-documents from the distribution of words, the pseudo-documents comprising a similar distribution of words as the original documents. A device for classifying textual documents comprises a processor; and a memory communicatively coupled to the processor, the memory comprising a sampling module to, when executed by the processor, determine the distribution of words in a number of original documents, a pseudo-document creation module to, when executed by the processor, create a number of pseudo-documents from the distribution of words, the pseudo-documents comprising a similar distribution of words as the original documents, and a training module to, when executed by the processor, train the device to classify textual documents based on the pseudo-documents.
摘要:
A system for comprehensively and effectively acquiring, as an interest graph, targets and regions of interest unique to a user is provided. A system according to the invention is a search system using as input means image information containing various objects and subjects, in which by querying an image recognition engine on the server side via a network about a selected image, the image recognition engine extracts and recognizes in real time various generic objects and the like contained in the specified image, and notifies a relevance search engine on the server side of recognized image components contained in the input image, and the relevance search engine extracts related elements for the individual image components, and visually presents a relevance graph with the extracted related elements as nodes together with the depths of relationships between the nodes.
摘要:
Systems and methods for scoring similarity count a number of matching visual words between a query image and a candidate image, generate an image similarity score for the query image and the candidate image based at least in part on the number of matching visual words, and generate a normalized image similarity score based at least in part on the image similarity score and one or more of a complexity of the query image and a complexity of the candidate image.
摘要:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining image search results. One of the methods includes generating a plurality of feature vectors for each image in a collection of images, wherein each feature vector is associated with an image tile of an image, wherein each feature vector corresponds to one of a plurality of predetermined visual words. All images in the collection of images that share at least a threshold number of matching visual words associated with matching image tiles are classified as near-duplicate images.
摘要:
A method comprises computing a color factor value indicating an amount of color gradients in at least one color channel from the query image. The method comprises combining the color-keypoints with the gray-keypoints when the color factor value is greater than a threshold. A method for performing a visual search comprises extracting a plurality of local descriptors from a query image and then selecting a subset of them based on various criteria's such as visual meaning score. A method comprises aggregating each mean vector for each visual codeword from distances between each visual codeword and local descriptors. The method comprises aggregating variance vector for each visual codeword from the distance between each visual codeword, and local descriptors. The method comprises transmitting aggregated mean vector information and aggregated variance vector information to a search server for efficient image retrieval.
摘要:
A method for presenting digital images having a high interest level to a particular person selected from a set of candidate digital images. The candidate digital image are analyzed to designate one or more image elements, and familiarity levels are determined of the designated image elements to the particular person. For each candidate digital image, an associated interest level to the particular person is determined responsive to the determined familiarity levels. One or more of the candidate digital images are selected based on the determined interest levels and are presented to the particular person.
摘要:
The present disclosure discloses a method and apparatus for creating a sample image index table, filtering image, and searching image, to improve accuracy of monitoring images. A method for image filtering comprises: establishing a sample image index table; extracting regional characteristics from an image to be searched; clustering the regional characteristics of the image to be searched into corresponding nodes; obtaining a corresponding sample image identification by indexing the sample image index table using node identifications of the nodes of the image to be searched; determining a number of duplicate nodes between the image to be searched and the sample image; obtaining a degree of similarity of the image to be searched based on a number of the nodes of the image to be searched and a number of the nodes of the sample image; and filtering out the image to be searched when a degree of similarity between the image to be searched and the sample image exceeds a similarity threshold.
摘要:
Determining ranked candidate media in response to media query data corresponding to a query media includes receiving the media query data including feature data of the query media, coordinate data, and boundary data, matching the features with corresponding features of an media database using the feature data to identify features in the media database within a predetermined hamming distance in a hash table from the corresponding features of the query media to obtain matched features in the media database, determining candidate media whose number of matched features exceeds a matched feature threshold, generating a geometry similarity score between the query media and each candidate media using the feature data and the coordinate data, generating a boundary similarity score between the query media and each candidate media using the boundary data, ranking the candidate media based on the numbers of matched features, the geometry similarity scores and the boundary similarity scores.
摘要:
A method for determining a semantic concept classification for a digital video clip based on a grouplet dictionary that includes a plurality of temporally-correlated grouplets. The temporally-correlated grouplets include textual codewords and either visual codewords or audio codewords, wherein the codewords in a particular temporally-correlated grouplet were determined to be correlated with each other based on analysis of a set of training videos. Reference video codeword similarity scores are determined for a set of reference video clips, and codeword similarity scores are determined for the digital video clip. A reference video similarity score is determined for each reference video clip representing a similarity between the digital video clip and the reference video clip based on the reference video codeword similarity scores, the codeword similarity scores, and the temporally-correlated grouplets. One or more semantic concept classifications are determined using trained semantic classifiers responsive to the determined reference video similarity scores.