Abstract:
This invention relates to building a landmark database from web data. In one embodiment, a computer-implemented method builds a landmark database. Web data including a web page is received from one or more websites via one or more networks. The web data is interpreted using at least one processor to determine landmark data describing a landmark. At least a portion of the landmark data identifies a landmark. Finally, a visual model is generated using the landmark data. A computing device is able to recognize the landmark in an image based on the visual model.
Abstract:
Methods, systems and apparatus for identifying modified images based on visual dissimilarity to a first image. In an aspect, a method includes determining, for each of a first image and a second image, a respective set of local image feature descriptions; determining one or more unmatched regions of the images that include unmatched image features and that correspond to one or more same respective regions in both the first image and the second image; determining, for each of the one or more unmatched regions of the images, a modification measure based on the image data corresponding to the unmatched region in the first image and the image data corresponding to the unmatched region in the second image; and determining that the second image is a modification of the first image when one of the modification measures meets a modification measure threshold.
Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for creating an image similarity model. In one aspect, a method includes obtaining feature vectors for images in a set of images, and determining first similarity measures for unlabeled images relative to a reference image. The first similarity measures are independent of first similarity feedback between the unlabeled images and the reference image. The unlabeled images are ranked based on the first similarity measures, and a weighted feature vector is generated based, in part, on the ranking. Second similarity measures are determined, independent of second similarity feedback, for labeled images and a second reference image. The labeled images are ranked based on the second similarity measures. The weighted feature vector is adjusted based, in part, on a comparison of the ranking to a second ranking of the labeled images that is based on the second similarity feedback.
Abstract:
Methods, systems and apparatus for identifying modified images based on seed images that are known to be modified images. In an aspect, a method includes accessing data identifying a set of first seed images; for each first seed image, determining a respective first set of similar images from images in an image corpus, each similar image having a visual similarity score that is a measure of visual similarity of the similar image to the first seed image based on the image content of the similar image and the first seed image that satisfies a first seed image similarity threshold; and for each similar image in each respective first set of similar images, attributing to the similar image signal data of each first seed image for which the similar image has a respective visual similarity score satisfying the first seed image similarity threshold.
Abstract:
A computer implemented technique for presenting selected image search results is presented. The technique includes obtaining a first query at a first time and obtaining a first set of image search results responsive to the first query. The technique also includes providing the first set of image search results in response to the first query and obtaining input data reflecting a selection of at least one of the first set of image search results. The technique further includes obtaining a second query at a second time subsequent to the first time and obtaining a second set of image search results responsive to the second query. The technique further includes providing the second set of image search results together with the selected at least one of the first set of image search results.
Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for creating an image similarity model. In one aspect, a method includes obtaining feature vectors for images in a set of images, and determining first similarity measures for unlabeled images relative to a reference image. The first similarity measures are independent of first similarity feedback between the unlabeled images and the reference image. The unlabeled images are ranked based on the first similarity measures, and a weighted feature vector is generated based, in part, on the ranking. Second similarity measures are determined, independent of second similarity feedback, for labeled images and a second reference image. The labeled images are ranked based on the second similarity measures. The weighted feature vector is adjusted based, in part, on a comparison of the ranking to a second ranking of the labeled images that is based on the second similarity feedback.
Abstract:
Methods, systems and apparatus for identifying modified images based on seed images that are known to be modified images. In an aspect, a method includes accessing data identifying a set of first seed images; for each first seed image, determining a respective first set of similar images from images in an image corpus, each similar image having a visual similarity score that is a measure of visual similarity of the similar image to the first seed image based on the image content of the similar image and the first seed image that satisfies a first seed image similarity threshold; and for each similar image in each respective first set of similar images, attributing to the similar image signal data of each first seed image for which the similar image has a respective visual similarity score satisfying the first seed image similarity threshold.
Abstract:
This invention relates to building a landmark database from web data. In one embodiment, a computer-implemented method builds a landmark database. Web data including a web page is received from one or more websites via one or more networks. The web data is interpreted using at least one processor to determine landmark data describing a landmark. At least a portion of the landmark data identifies a landmark. Finally, a visual model is generated using the landmark data. A computing device is able to recognize the landmark in an image based on the visual model.
Abstract:
Methods, systems, and apparatus, for determining fine-grained image similarity. In one aspect, a method includes training an image embedding function on image triplets by selecting image triplets of first, second and third images; generating, by the image embedding function, a first, second and third representations of the features of the first, second and third images; determining, based on the first representation of features and the second representation of features, a first similarity measure for the first image to the second image; determining, based on the first representation of features and the third representation of features, a second similarity measure for the first image to the third image; determining, based on the first and second similarity measures, a performance measure of the image embedding function for the image triplet; and adjusting the parameter weights of the image embedding function based on the performance measures for the image triplets.
Abstract:
Systems and methods are provided herein relating to video classification. A text mining component is disclosed that automatically generates a plurality of video event categories. Part-of-Speech (POS) analysis can be applied to video titles and descriptions, further using a lexical hierarchy to filter potential classifications. Classification performance can be further improved by extracting content-based features from a video sample. Using the content based features a set of classifier scores can be generated. A hyper classifier can use both the classifier scores and the content-based features of the video to classify the video sample.