Pitch shift resistant audio matching
    1.
    发明授权
    Pitch shift resistant audio matching 有权
    音高偏移音频匹配

    公开(公告)号:US09052986B1

    公开(公告)日:2015-06-09

    申请号:US13450422

    申请日:2012-04-18

    IPC分类号: G06F17/00

    摘要: Systems and methods are provided herein relating to audio matching. Both melody fingerprints and audio-id fingerprints can be used to improve an audio matching system's resistance to pitch shifts. A melody fingerprint can be used to identify a set of potential melody matches. Varying pitch shifted audio-id reference fingerprints can be generated for audio-id fingerprints associated with the potential matches identified in melody matching. Additional pitch shifted audio-id fingerprints of a reference sample are generated and used in matching only if an audio sample has previously been matched to a melody fingerprint of the same reference sample. A reference index need not be expanded to include pitch shifted variations of each reference sample as pitch shifted variations of audio-id fingerprint reference samples are generated and used only if their associated melody fingerprint is deemed a potential match.

    摘要翻译: 本文提供了与音频匹配有关的系统和方法。 可以使用旋律指纹和音频编码指纹来改善音频匹配系统对音调偏移的阻力。 旋律指纹可用于识别一组潜在的旋律比赛。 可以为与旋律匹配中识别的潜在匹配相关联的音频ID指纹生成不同的音调移位音频参考指纹。 生成参考样本的附加音调移位音频ID指纹,并且仅在音频样本先前已经匹配到相同参考样本的旋律指纹的情况下才被匹配。 参考索引不需要扩展为包括每个参考样本的音调偏移变化,因为仅当音频ID指纹参考样本的相关旋律指纹被认为是潜在匹配时才产生音频ID指纹参考样本的音调偏移变化。

    Full digest of an audio file for identifying duplicates
    3.
    发明授权
    Full digest of an audio file for identifying duplicates 有权
    用于识别重复的音频文件的完整摘要

    公开(公告)号:US08953811B1

    公开(公告)日:2015-02-10

    申请号:US13450427

    申请日:2012-04-18

    摘要: Systems and methods are provided herein relating to audio matching. A compact digest can be generated based on sets of triples, where triples are groupings of three interest points that meet threshold criteria. The compact digest can be used in identifying a potential audio match. A full digest can then be used in verifying the potential match. By using a compact digest to perform audio matching, the audio matching system can be scaled to encompass millions or billions of reference audio samples while still using the full digest to maintain accuracy.

    摘要翻译: 本文提供了与音频匹配有关的系统和方法。 可以基于三元组的集合生成紧凑的摘要,其中三元组是满足阈值标准的三个兴趣点的分组。 紧凑的摘要可用于识别潜在的音频匹配。 然后可以使用完整的摘要来验证潜在的匹配。 通过使用紧凑的摘要来执行音频匹配,音频匹配系统可以缩放以涵盖数百万或数十亿的参考音频样本,同时仍然使用完整的摘要来保持准确性。

    Audio matching using time alignment, frequency alignment, and interest point overlap to filter false positives
    4.
    发明授权
    Audio matching using time alignment, frequency alignment, and interest point overlap to filter false positives 有权
    使用时间对齐,频率对齐和兴趣点重叠的音频匹配来过滤假阳性

    公开(公告)号:US09268845B1

    公开(公告)日:2016-02-23

    申请号:US13415786

    申请日:2012-03-08

    IPC分类号: G06F17/30

    CPC分类号: G06F17/3074 G06F17/30743

    摘要: Systems and methods audio matching using interest point overlap are disclosed herein. The systems include determining at least one matching reference segment based on a probe segment. Interest points for both the at least one matching reference segment and the probe segment can be generated. Probe segment interest points and matching reference segment interest points can be time aligned and frequency aligned. A count can be generated based on a number of overlapping interest points between each set of reference interest points and the set of probe segment interest points. The disclosed systems and methods allow false positive reference to be identified and eliminated based on the count. The benefits in eliminating false positive matches improve the accuracy of an audio matching system.

    摘要翻译: 本文公开了使用兴趣点重叠的系统和方法音频匹配。 所述系统包括基于探测段来确定至少一个匹配的参考段。 可以生成至少一个匹配参考段和探针段的兴趣点。 探测段利息点和匹配参考段的兴趣点可以进行时间对齐和频率对齐。 可以基于每组参考兴趣点和探头段兴趣点集合之间的重叠的兴趣点的数量来生成计数。 所公开的系统和方法允许基于计数来识别和消除假阳性参考。 消除假阳性匹配的好处可以提高音频匹配系统的准确性。

    Adaptive weighting of popular reference content in audio matching
    5.
    发明授权
    Adaptive weighting of popular reference content in audio matching 有权
    音频匹配中流行参考内容的自适应加权

    公开(公告)号:US09087124B1

    公开(公告)日:2015-07-21

    申请号:US13430134

    申请日:2012-03-26

    IPC分类号: G10L19/00 G06F17/30 H04R29/00

    CPC分类号: G06F17/30743 G06F17/30749

    摘要: Systems and methods are provided herein relating to audio matching. Adaptive weighting of popular reference content can be used to more efficiently allocate space in a weighted reference index used to match audio signals. An audio reference index can be maintained that contains a set of audio references wherein each audio reference in the set of audio references is associated with a score. A weighted reference index can be generated based on the audio reference index and the score associated with each audio reference wherein respective audio references are up-weighted or up-scored based at least in part of user popularity. The benefits in using adaptive weighting of popular reference content can improve the accuracy of an audio matching system.

    摘要翻译: 本文提供了与音频匹配有关的系统和方法。 流行参考内容的自适应加权可用于更有效地分配用于匹配音频信号的加权参考索引中的空间。 可以保持音频参考索引,该索引包含一组音频引用,其中该组音频引用中的每个音频参考与分数相关联。 可以基于音频参考索引和与每个音频参考相关联的分数来生成加权参考索引,其中相应的音频参考基于用户普及度至少部分地被加权或上分数。 使用流行参考内容的自适应加权的好处可以提高音频匹配系统的准确性。

    Transformation invariant media matching
    6.
    发明授权
    Transformation invariant media matching 有权
    转换不变媒体匹配

    公开(公告)号:US08738633B1

    公开(公告)日:2014-05-27

    申请号:US13362905

    申请日:2012-01-31

    IPC分类号: G06F17/30

    摘要: This disclosure relates to transformation invariant media matching. A fingerprinting component can generate a transformation invariant identifier for media content by adaptively encoding the relative ordering of interest points in media content. The interest points can be grouped into subsets, and stretch invariant descriptors can be generated for the subsets based on ratios of coordinates of interest points included in the subsets. The stretch invariant descriptors can be aggregated into a transformation invariant identifier. An identification component compares the identifier against a set of identifiers for known media content, and the media content can be matched or identified as a function of the comparison.

    摘要翻译: 本公开涉及变换不变媒体匹配。 指纹分量可以通过对媒体内容中的兴趣点的相对排序进行自适应编码来生成媒体内容的变换不变标识符。 可以将兴趣点分组为子集,并且可以基于子集中包括的兴趣点坐标的比例为子集生成拉伸不变描述符。 拉伸不变描述符可以聚合成变换不变标识符。 识别部件将标识符与已知媒体内容的一组标识符进行比较,并且媒体内容可以作为比较的函数进行匹配或标识。

    Ensemble interest point detection for audio matching
    8.
    发明授权
    Ensemble interest point detection for audio matching 有权
    音乐匹配的集合兴趣点检测

    公开(公告)号:US09098576B1

    公开(公告)日:2015-08-04

    申请号:US13274725

    申请日:2011-10-17

    IPC分类号: G06F17/00 G06F17/30

    CPC分类号: G06F17/30743 G06F17/3074

    摘要: Systems and methods for audio matching are disclosed herein. In one embodiment, a system includes both interest point mixing and fingerprint mixing by using multiple interest point detection methods in parallel. Since multiple interest point detection methods are used in parallel, accuracy of audio matching is improved across a wide variety of audio signals. In addition the scalability of the disclosed audio matching system is increased by matching the fingerprint of an audio sample with a fingerprint of a reference sample versus matching an entire spectrogram. Accordingly, a more accurate and more general solution to audio matching can be accomplished.

    摘要翻译: 本文公开了用于音频匹配的系统和方法。 在一个实施例中,系统通过并行使用多个兴趣点检测方法来包括兴趣点混合和指纹混合。 由于并行地使用多个兴趣点检测方法,因此在多种音频信号中提高了音频匹配的精度。 此外,通过将音频样本的指纹与参考样本的指纹匹配以匹配整个频谱图来增加所公开的音频匹配系统的可扩展性。 因此,可以实现更准确和更一般的音频匹配解决方案。

    Detection of inactive broadcasts during live stream ingestion
    9.
    发明授权
    Detection of inactive broadcasts during live stream ingestion 有权
    在实况流摄入过程中检测无效广播

    公开(公告)号:US08938089B1

    公开(公告)日:2015-01-20

    申请号:US13533818

    申请日:2012-06-26

    IPC分类号: G06K9/00

    摘要: Systems and methods are provided herein relating to real-time detection of inactive broadcasts during live stream ingestion. Both audio fingerprints and video fingerprints can be dynamically and continuously generated for a live stream ingestion. Sets of video fingerprints and sets of audio fingerprints can be continuously generated based on common successive overlapping time windows. A set of audio fingerprints and a set of video fingerprints can be associated with each time window. Video similarity scores and audio similarity scores can be generates for each time window to determine whether the stream is inactive or static during the time window. Only fingerprints relating to an active broadcast can be indexed in a fingerprint index.

    摘要翻译: 本文提供的系统和方法涉及在实时流摄取期间实时检测无效广播。 可以动态连续地生成音频指纹和视频指纹,用于实况流摄取。 可以基于共同的连续重叠时间窗口连续地生成视频指纹集和音频指纹集。 一组音频指纹和一组视频指纹可以与每个时间窗口相关联。 可以为每个时间窗口生成视频相似度分数和音频相似性分数,以在时间窗口中确定流是不活动还是静态。 只有与主动广播相关的指纹才能在指纹索引中索引。

    Intelligent interest point pruning for audio matching
    10.
    发明授权
    Intelligent interest point pruning for audio matching 有权
    智能兴趣点修剪音频匹配

    公开(公告)号:US08831763B1

    公开(公告)日:2014-09-09

    申请号:US13276316

    申请日:2011-10-18

    IPC分类号: G10L15/20 G10L15/02

    CPC分类号: G10L25/54 G10L25/87

    摘要: System and methods for intelligently pruning interest points are disclosed herein. The systems include generating a plurality of distorted audio samples and associated distorted interest points based upon a clean audio sample. Interest points that are common to sets of distorted interest points are retained with interest points not robust to distortion discarded. The disclosed systems and methods therefore can provide for a scalable audio matching solution by eliminating interest points in reference sample fingerprints. The set of pruned interest points are robust to distortion and the benefits of both scalability and accuracy can be had.

    摘要翻译: 本文公开了用于智能修剪兴趣点的系统和方法。 系统包括基于干净的音频样本产生多个失真的音频样本和相关联的失真的兴趣点。 对于一组扭曲的兴趣点常见的兴趣点被保留,对于丢弃的失真不利于兴趣点。 因此,所公开的系统和方法可以通过消除参考样本指纹中的兴趣点来提供可扩展的音频匹配解决方案。 修剪的兴趣点的集合对于失真是稳健的,并且可以实现可扩展性和准确性的优点。