Abstract:
A method is described that includes producing an audio spectrogram from a target sample, generating a number of fingerprints based on the audio spectrogram, comparing the series of fingerprints to samples in a data repository using wavelet coefficients, and identifying the target sample based on the matches found in the data repository.
Abstract:
An image similar to a target image is selected from among a set of candidate images. A set of image classifiers is first generated and used to create a fingerprint for each candidate image. A hash table is generated for each fingerprint segment, and an identifier for each candidate image is stored in each hash table based on the candidate image fingerprint value for the fingerprint segment associated with the hash table. A fingerprint is created for the target image using the set of classifiers. Segments of the target image fingerprints are compared to segments of the candidate image fingerprints using the hash table, and a candidate image similar to the target image is selected based on this comparison.
Abstract:
Compression of an image is performed based on prediction of target blocks of an image from candidate source blocks of the image. Heuristics are used for identifying the candidate source blocks, for example, source blocks are selected from within a cluster of similar blocks obtained by K-means clustering. For each target block, a region adjacent to the target block is identified and a set of candidate source blocks along with candidate source regions adjacent to the candidate source blocks are identified. The candidate source regions are ranked based on the differences between the candidate source regions and the target source region. Each candidate source block is described using its rank and residual information describing differences between the candidate source block and the target block. The candidate source block that can be described using a minimum amount of information is selected for predicting the target block.
Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, including a method that comprises: determining a target geographic feature that has insufficient targeting information associated therewith, the target geographic feature defining a location; determining one or more similar geographic features to the target geographic feature, each geographic feature including targeting information, the determining based on shared excess queries that are shared between geographic features that are determined to be similar to the target geographic feature, wherein each excess query is a query associated with and exceeds an expected query count for each of the similar geographic features and the target geographic feature; attributing targeting information associated with at least one of the one or more similar geographic features to the target geographic feature; and serving content responsive to queries that relate to the target geographic feature based at least in part on the attributed targeting information.
Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, including a method that comprises: determining a target geographic feature that has insufficient targeting information associated therewith, the target geographic feature defining a location; determining one or more similar geographic features to the target geographic feature, each geographic feature including targeting information, the determining based on shared excess queries that are shared between geographic features that are determined to be similar to the target geographic feature, wherein each excess query is a query associated with and exceeds an expected query count for each of the similar geographic features and the target geographic feature; attributing targeting information associated with at least one of the one or more similar geographic features to the target geographic feature; and serving content responsive to queries that relate to the target geographic feature based at least in part on the attributed targeting information.
Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer-readable storage medium, including a method for ranking content using location-based query log analysis. The method comprises: identifying a region defining an area of interest including identifying a plurality of content items that are associated with the region; evaluating query logs associated with users that submitted queries associated with the region to determine a ranking associated with the plurality of content items; receiving a request for content associated with the region; and providing one or more of the content items based at least in part on the ranking.
Abstract:
Methods, systems, and apparatus, including computer program products, for ranking images are disclosed. An image search subsystem generates an adjustment factor representative of a quality measure of an image relative to a search query. The quality represents a relevance of the image to the query. The adjustment factor can be computed based on relevance data for the image to the query and image similarity data representing a relative similarity between the image and other images relevant to the query. The relevance data can be based on user actions in response to the image being included in search results for the query. The adjustment factor can be scaled based on whether the relevance data and the image similarity data both indicate that the image is relevant to the search query. A relevance score is computed based on the adjustment factor (e.g., a product of the adjustment factor and relevance score).
Abstract:
Systems, methods, devices, and computer program products provide social and interactive applications for mass media based on real time ambient-audio and/or video identification. In some implementations, a method includes: receiving descriptors identifying ambient audio associated with a media broadcast; comparing the descriptors to one or more reference descriptors; and determining a rating for the media broadcast based at least in part on the results of the comparison.
Abstract:
Systems, methods, devices, and computer program products provide social and interactive applications for mass media based on real time ambient-audio and/or video identification. In some implementations, a method includes: receiving descriptors identifying ambient audio associated with a media broadcast; comparing the descriptors to one or more reference descriptors; and determining a rating for the media broadcast based at least in part on the results of the comparison.
Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for predicting user interests. In one aspect, a method includes training a prediction model to generate a category prediction of a next query from a set of queries, the category prediction specifying categories to which a next query belongs, the prediction model comprising a plurality of prediction functions that each generate a category prediction of a next query, wherein the prediction functions include two or more of a time-based prediction functions that generate a category prediction based on the category data and a difference of submission times of the queries, a rank-based prediction functions that generate a category prediction based on the category data and a rank order by which the queries were received, and a category-based prediction function that generates a category prediction based on the category data.