摘要:
This disclosure relates to audio identification using ordinal transformations. A media matching component receives a sample audio file. The sample audio file can include, for example, a cover song. The media matching component includes a vector component that computes a set of vectors using auditory feature values included in the sample audio file. A hashing component employs a hash function to generate a fingerprint, including a set of sub-fingerprints, for the sample audio file using the set of vectors. The fingerprint is invariant to variations including but not limited to variations in key, instrumentation, encoding formats, performers, performance conditions, arrangement, and/or recording and processing variations. An identification component determines if any reference audio files are similar to the sample audio file using the fingerprint and/or sub-fingerprints, and identifies any similar reference audio files.
摘要:
Methods, systems and articles of manufacture for identifying semantic nearest neighbors in a feature space are described herein. A method embodiment includes generating an affinity matrix for objects in a given feature space, wherein the affinity matrix identifies the semantic similarity between each pair of objects in the feature space, training a multi-bit hash function using a greedy algorithm that increases the Hamming distance between dissimilar objects in the feature space while minimizing the Hamming distance between similar objects, and identifying semantic nearest neighbors for an object in a second feature space using the multi-bit hash function. A system embodiment includes a hash generator configured to generate the affinity matrix and train the multi-bit hash function, and a similarity determiner configured to identify semantic nearest neighbors for an object in a second feature space using the multi-bit hash function.
摘要:
Systems and methods for measuring consistency between two objects based upon a rank of object elements instead of based upon the values of those object elements. Objects being compared can be represented by d-dimension feature vectors, U and V, where each dimension includes an associated value. U and V can be converted to rank vectors, P and Q, where values of U and V dimensions are replaced by an ordered rank or a function thereof. Analysis directed to the consistency between U and V can be accomplished by determining consistency between P and Q, which can be more efficient and more accurate, particularly with regard to illumination-invariant comparisons.
摘要:
A system and method detects matches between portions of video content. A matching module receives an input video fingerprint representing an input video and a set of reference fingerprints representing reference videos in a reference database. The matching module compares the reference fingerprints and input fingerprints to generate a list of candidate segments from the reference video set. Each candidate segment comprises a time-localized portion of a reference video that potentially matches the input video. A classifier is applied to each of the candidate segments to classify the segment as a matching segment or a non-matching segment. A result is then outputted identifying a matching portion of a reference video from the reference video set based on the segments classified as matches.
摘要:
A video hosting service automatically identifies, in a video database, a set of videos associated with an advertiser, and presents the identified videos to the advertiser for consideration. The videos may be selected based on analysis of their video content for images of logos associated with the advertisers. The video hosting service may then receive from the advertisers a listing of which of the presented videos should be given an award.
摘要:
The subject matter of this specification can be embodied in, among other things, a method that includes inferring labels for videos, users, advertisements, groups of users, and other entities included in a social network system. The inferred labels can be used to generate recommendations such as videos or advertisements in which a user may be interested. Inferred labels can be generated based on social or other relationships derived from, for example, profiles or activities of social network users. Inferred labels can be advantageous when explicit information about these entities is not available. For example, a particular user may not have clicked on any online advertisements, so the user is not explicitly linked to any advertisements.
摘要:
The subject matter of this specification can be embodied in, among other things, a method that includes determining, for a portion of users of a social network, label values each comprising an inferred interest level of a user in a subject indicated by a label, associating a first user with one or more second users based on one or more relationships specified by the first user, and outputting a first label value for the first user based on one or more second label values of the one or more second users.
摘要:
A method, a system and a computer program product generate a statistical classification model used by a computer system to determine a class associated with an unlabeled time series event.
摘要:
The disclosed embodiments describe a method, an apparatus, an application specific integrated circuit, and a server that provides a fast and efficient look up for data analysis. The apparatus and server may be configured to obtain data segments from a plurality of input devices. The data segments may be individual unique subsets of the entire data set obtained by a plurality input devices. A hash function may be applied to an aggregated set of the data segments. A result of the hash function may be stored in a data structure. A codebook may be generated from the hash function results.
摘要:
A method includes identifying a named entity, retrieving images associated with the named entity, and using a face detection algorithm to perform face detection on the retrieved images to detect faces in the retrieved images. At least one representative face image from the retrieved images is identified, and the representative face image is used to identify one or more additional images representing the at least one named entity.