摘要:
Methods, systems, and apparatus, including computer programs encoded on a computer-readable storage medium for transcribing information. A method includes: identifying a telephone number that once dialed has an associated message that is played or includes a response system; transcribing the message or information about the response system; storing the transcribed message or information in association with the telephone number in database; receiving a request from a user that includes the telephone number; and providing information about the transcribed information to the user.
摘要:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, including a method that comprises: determining excess queries for a target geographic feature, where the geographic feature defines a location; determining one or more candidate geographic features that have similar excess queries, but displaced in time; determining a time offset between the target geographic feature and a candidate geographic feature based on the displacement in time of the similar excess queries; and targeting content to the candidate geographic feature using the time offset and based on content targeted to the target geographic feature.
摘要:
Systems and methods of evaluating information via a computer network are provided. A content group can be identified, and each item of the content group can be associated with a vector indicating at least one user interest category of users exposed to the item. The vectors of each item can be evaluated to generate a first nearest neighbor list of each item of the content group. The nearest neighbor list of a first item can be compared with the nearest neighbor list of a second item. Based on a result of the comparison, the first and second items can be associated in a cluster.
摘要:
An exemplar dictionary is built from exemplars of digital content for determining predictor blocks for encoding and decoding digital content. The exemplar dictionary organizes the exemplars as clusters of similar exemplars. Each cluster is mapped to a label. Machine learning techniques are used to generate a prediction model for predicting a label for an exemplar. The exemplar dictionary is used to encode digital content. Clusters of exemplars are obtained by applying a prediction model to a target block of digital content for encoding. A predictor block is selected for encoding the target block based on frequency of occurrence of exemplars in the clusters. The target block is encoded using the predictor block.
摘要:
Methods, systems, and computer program products, including computer programs encoded on a computer readable storage medium, for providing content to a user based on the mode of the user. A method includes: identifying a user for targeting content; evaluating usage information for the user to determine targeting information for a plurality of modes associated with the user; receiving a request to deliver content to the user including an identifier associated with the user and information to determine which mode of the plurality of modes the user is operating in; and providing content to the user based on the mode and associated targeting information.
摘要:
Methods, systems, and apparatus, including computer program products, for ranking images are disclosed. An image search subsystem generates an adjustment factor representative of a quality measure of an image relative to a search query. The quality represents a relevance of the image to the query. The adjustment factor can be computed based on relevance data for the image to the query and image similarity data representing a relative similarity between the image and other images relevant to the query. The relevance data can be based on user actions in response to the image being included in search results for the query. The adjustment factor can be scaled based on whether the relevance data and the image similarity data both indicate that the image is relevant to the search query. A relevance score is computed based on the adjustment factor (e.g., a product of the adjustment factor and relevance score).
摘要:
Methods, systems, and computer program products are provided for mapping keywords to geographic features. One example method includes identifying location keywords for each of a multitude of granular locations, determining a feature size for grouping granular locations over an area of interest, determining geo data for one or more features, locating all granular locations which are associated with a given feature using the geo data and forming a set of granular locations per feature, aggregating the location keywords for each granular location in a set forming a keyword mapping for the given feature, receiving an indication of a geographic location that is proximate to a user or is of interest to the user, determining a geographic feature associated with the geographic location, and targeting content for delivery to the user based at least in part on the keyword mapping.
摘要:
A computer-implemented method is disclosed for generating a signature representing an input bit vector. A signature generator generates a primary min-hash value based on a primary permutation from a sequence of permutation blocks. If the primary min-hash value is lower than a threshold value, a secondary min-hash value is generated based on a secondary permutation from the same permutation block. The signature generator then determines one or more signature values based on the primary min-hash value, the secondary min-hash value or both. The one or more signature values are stored as elements of the signature.
摘要:
A method is described that includes producing an audio spectrogram from a target sample, generating a number of fingerprints based on the audio spectrogram, comparing the series of fingerprints to samples in a data repository using wavelet coefficients, and identifying the target sample based on the matches found in the data repository.
摘要:
In general, the subject matter described in this specification can be embodied in methods, systems, and program products. A plurality of electronic training images that are each classified as displaying substantially pictures is obtained. A plurality of local image features in each of the plurality of electronic training images is identified. A plurality of weak classifiers are recursively applied to the local image features. During each iteration a weak classifier that accurately classifies the local images features is selected. After each selection of a weak classifier features that were misclassified by the selected weak classifier are given greater weight than features that were classified correctly by the selected weak classifier. For each selected weak classifier a hillclimbing algorithm is performed to attempt to improve the weak classifier. A strong classifier that is a weighted combination of the selected weak classifiers on which hillclimbing algorithms have been performed is produced.