摘要:
Systems and methods are provided herein relating to audio matching. A compact digest can be generated based on sets of triples, where triples are groupings of three interest points that meet threshold criteria. The compact digest can be used in identifying a potential audio match. A full digest can then be used in verifying the potential match. By using a compact digest to perform audio matching, the audio matching system can be scaled to encompass millions or billions of reference audio samples while still using the full digest to maintain accuracy.
摘要:
This disclosure relates to transformation invariant media matching. A fingerprinting component can generate a transformation invariant identifier for media content by adaptively encoding the relative ordering of interest points in media content. The interest points can be grouped into subsets, and stretch invariant descriptors can be generated for the subsets based on ratios of coordinates of interest points included in the subsets. The stretch invariant descriptors can be aggregated into a transformation invariant identifier. An identification component compares the identifier against a set of identifiers for known media content, and the media content can be matched or identified as a function of the comparison.
摘要:
A combined fingerprint is generated for a video that can match two near-identical videos that differ only in their aspect ratios or formats. A transformation strategy is selected by selecting a first and a second aspect correction method. A first transformed video is generated by applying the first aspect correction method to the video. A second transformed video is generated by applying the second aspect correction method to the video. A first fingerprint is generated using the first transformed video. A second fingerprint is generated using the second transformed video. The combined fingerprint is generated by combining the first half of the first fingerprint with the second half of the second fingerprint.
摘要:
Systems and methods are provided herein relating to audio matching. Both melody fingerprints and audio-id fingerprints can be used to improve an audio matching system's resistance to pitch shifts. A melody fingerprint can be used to identify a set of potential melody matches. Varying pitch shifted audio-id reference fingerprints can be generated for audio-id fingerprints associated with the potential matches identified in melody matching. Additional pitch shifted audio-id fingerprints of a reference sample are generated and used in matching only if an audio sample has previously been matched to a melody fingerprint of the same reference sample. A reference index need not be expanded to include pitch shifted variations of each reference sample as pitch shifted variations of audio-id fingerprint reference samples are generated and used only if their associated melody fingerprint is deemed a potential match.
摘要:
Systems and methods described herein can assign a confidence score to a match of unstructured descriptive information with structured reference information in a reference database. The systems and methods can take into account the structured nature of the reference information in assigning the score, thereby facilitating increased confidence in the match, and consequently, facilitating improved database organization and content identification.
摘要:
Systems and methods audio matching using interest point overlap are disclosed herein. The systems include determining at least one matching reference segment based on a probe segment. Interest points for both the at least one matching reference segment and the probe segment can be generated. Probe segment interest points and matching reference segment interest points can be time aligned and frequency aligned. A count can be generated based on a number of overlapping interest points between each set of reference interest points and the set of probe segment interest points. The disclosed systems and methods allow false positive reference to be identified and eliminated based on the count. The benefits in eliminating false positive matches improve the accuracy of an audio matching system.
摘要:
Systems and methods are provided herein relating to audio matching. Adaptive weighting of popular reference content can be used to more efficiently allocate space in a weighted reference index used to match audio signals. An audio reference index can be maintained that contains a set of audio references wherein each audio reference in the set of audio references is associated with a score. A weighted reference index can be generated based on the audio reference index and the score associated with each audio reference wherein respective audio references are up-weighted or up-scored based at least in part of user popularity. The benefits in using adaptive weighting of popular reference content can improve the accuracy of an audio matching system.
摘要:
This disclosure relates to learning common spelling errors of metadata terms associated with content through content matching, such as content matching using fingerprints.
摘要:
Systems and methods for audio matching are disclosed herein. In one embodiment, a system includes both interest point mixing and fingerprint mixing by using multiple interest point detection methods in parallel. Since multiple interest point detection methods are used in parallel, accuracy of audio matching is improved across a wide variety of audio signals. In addition the scalability of the disclosed audio matching system is increased by matching the fingerprint of an audio sample with a fingerprint of a reference sample versus matching an entire spectrogram. Accordingly, a more accurate and more general solution to audio matching can be accomplished.
摘要:
Systems and methods are provided herein relating to real-time detection of inactive broadcasts during live stream ingestion. Both audio fingerprints and video fingerprints can be dynamically and continuously generated for a live stream ingestion. Sets of video fingerprints and sets of audio fingerprints can be continuously generated based on common successive overlapping time windows. A set of audio fingerprints and a set of video fingerprints can be associated with each time window. Video similarity scores and audio similarity scores can be generates for each time window to determine whether the stream is inactive or static during the time window. Only fingerprints relating to an active broadcast can be indexed in a fingerprint index.