摘要:
Systems and methods are provided herein relating to audio matching. Both melody fingerprints and audio-id fingerprints can be used to improve an audio matching system's resistance to pitch shifts. A melody fingerprint can be used to identify a set of potential melody matches. Varying pitch shifted audio-id reference fingerprints can be generated for audio-id fingerprints associated with the potential matches identified in melody matching. Additional pitch shifted audio-id fingerprints of a reference sample are generated and used in matching only if an audio sample has previously been matched to a melody fingerprint of the same reference sample. A reference index need not be expanded to include pitch shifted variations of each reference sample as pitch shifted variations of audio-id fingerprint reference samples are generated and used only if their associated melody fingerprint is deemed a potential match.
摘要:
Systems and methods described herein can assign a confidence score to a match of unstructured descriptive information with structured reference information in a reference database. The systems and methods can take into account the structured nature of the reference information in assigning the score, thereby facilitating increased confidence in the match, and consequently, facilitating improved database organization and content identification.
摘要:
A combined fingerprint is generated for a video that can match two near-identical videos that differ only in their aspect ratios or formats. A transformation strategy is selected by selecting a first and a second aspect correction method. A first transformed video is generated by applying the first aspect correction method to the video. A second transformed video is generated by applying the second aspect correction method to the video. A first fingerprint is generated using the first transformed video. A second fingerprint is generated using the second transformed video. The combined fingerprint is generated by combining the first half of the first fingerprint with the second half of the second fingerprint.
摘要:
A system and method for evaluating claims is disclosed. The system comprises a selection module, a query module, a communication module and a determination module. The selection module determines a review set including one or more claims based at least in part on claim data. The query module determines, based at least in part on the review set, an evaluation form including one or more queries associated with a first claim. The communication module receives answer data describing one or more answers responsive to the one or more queries included in the evaluation form. The determination module determines a validity decision associated with the first claim based at least in part on the answer data.
摘要:
Systems and methods are provided herein relating to audio matching. A compact digest can be generated based on sets of triples, where triples are groupings of three interest points that meet threshold criteria. The compact digest can be used in identifying a potential audio match. A full digest can then be used in verifying the potential match. By using a compact digest to perform audio matching, the audio matching system can be scaled to encompass millions or billions of reference audio samples while still using the full digest to maintain accuracy.
摘要:
Systems and methods audio matching using interest point overlap are disclosed herein. The systems include determining at least one matching reference segment based on a probe segment. Interest points for both the at least one matching reference segment and the probe segment can be generated. Probe segment interest points and matching reference segment interest points can be time aligned and frequency aligned. A count can be generated based on a number of overlapping interest points between each set of reference interest points and the set of probe segment interest points. The disclosed systems and methods allow false positive reference to be identified and eliminated based on the count. The benefits in eliminating false positive matches improve the accuracy of an audio matching system.
摘要:
Systems and methods are provided herein relating to audio matching. Adaptive weighting of popular reference content can be used to more efficiently allocate space in a weighted reference index used to match audio signals. An audio reference index can be maintained that contains a set of audio references wherein each audio reference in the set of audio references is associated with a score. A weighted reference index can be generated based on the audio reference index and the score associated with each audio reference wherein respective audio references are up-weighted or up-scored based at least in part of user popularity. The benefits in using adaptive weighting of popular reference content can improve the accuracy of an audio matching system.
摘要:
Systems and methods are provided herein relating to real-time duplicate detection of video content. Fingerprints can be generated for an uploaded video. The fingerprints can be used to match the uploaded video to a set of matching videos. The set of matching videos can be filtered based on the type of match, and the quality of the match. A unique cluster-id can be generated for the uploaded video containing an upload time, and that unique cluster-id can then be modified to associate the uploaded video with a cluster-id of potential duplicates. Cluster-ids can then be used in the context of a search to filter results that have identical cluster-ids. The benefits in using real-time duplicate detection can better maximize user experiences in a video sharing service that contains potential duplicates of the same content.
摘要:
This disclosure relates to transformation invariant media matching. A fingerprinting component can generate a transformation invariant identifier for media content by adaptively encoding the relative ordering of interest points in media content. The interest points can be grouped into subsets, and stretch invariant descriptors can be generated for the subsets based on ratios of coordinates of interest points included in the subsets. The stretch invariant descriptors can be aggregated into a transformation invariant identifier. An identification component compares the identifier against a set of identifiers for known media content, and the media content can be matched or identified as a function of the comparison.
摘要:
Large-scale matching of videos is performed by matching a set of probe videos against a set of reference videos to determine if they are visually and/or aurally similar. The visual and audio fingerprints of all probe videos and reference videos are divided into subfingerprints, which are divided into LSH bands. The LSH bands of the probe videos are sorted in one list, and the LSH bands of the reference videos are sorted in another list. Then, the two sorted lists are linearly scanned for matching LSH bands. The matching LSH bands are sorted by probe video ID, and each probe video ID is searched to find matches between probe videos and reference videos. Further, an incremental matching process identifies matches as groups of new probe videos and/or new reference videos are added, without unnecessary repetition of matching old probe videos against old reference videos.