Interface for real-time audio recognition
    11.
    发明授权
    Interface for real-time audio recognition 有权
    接口用于实时音频识别

    公开(公告)号:US09280599B1

    公开(公告)日:2016-03-08

    申请号:US13405023

    申请日:2012-02-24

    Abstract: An audio recognition service recognizes an audio sample across multiple content types. At least a partial set of results generated by the service are returned to a client while the audio sample is still being recorded and/or transmitted. The client additionally displays the results in real-time or near real-time to the user. The audio sample can be sent over a first HTTP connection and the results can be returned over a second HTTP connection. The audio recognition service further processes check-in selections received from the client for content items indicated by the results. Responsive to receiving the check-in selections, the service determines whether a user is eligible for a reward. If the user is eligible, the service provides the reward.

    Abstract translation: 音频识别服务识别多种内容类型的音频样本。 当音频样本仍然被记录和/或发送时,由服务产生的至少一部分结果返回给客户机。 客户端另外向用户实时或接近实时显示结果。 音频样本可以通过第一个HTTP连接发送,并且可以通过第二个HTTP连接返回结果。 音频识别服务进一步处理从客户端接收的用于由结果指示的内容项的登记选择。 响应于接收签入选择,该服务确定用户是否有资格获得奖励。 如果用户符合条件,则该服务将提供奖励。

    Aggregation of related media content
    12.
    发明授权
    Aggregation of related media content 有权
    相关媒体内容的汇总

    公开(公告)号:US09159364B1

    公开(公告)日:2015-10-13

    申请号:US13361778

    申请日:2012-01-30

    CPC classification number: G11B27/034 G11B27/10 G11B27/28 H04N5/04

    Abstract: Systems and methods for media aggregation are disclosed herein. The system includes a media system that can transform media items into one aggregated media item. A synchronization component synchronizes media items with respect to time. The synchronized media items can be analyzed and transformed into an aggregated media item for storage and/or display. In one implementation, the aggregated media item is capable of being displayed in multiple ways to create an enhanced and customizable viewing and/or listening experience.

    Abstract translation: 本文公开了用于介质聚集的系统和方法。 该系统包括可将媒体项目转换为一个聚合媒体项目的媒体系统。 同步组件相对于时间同步媒体项目。 可以将同步的媒体项目分析并转换成用于存储和/或显示的聚合媒体项目。 在一个实现中,聚合媒体项目能够以多种方式显示以创建增强的和可定制的观看和/或听觉体验。

    Ensemble interest point detection for audio matching
    13.
    发明授权
    Ensemble interest point detection for audio matching 有权
    音乐匹配的集合兴趣点检测

    公开(公告)号:US09098576B1

    公开(公告)日:2015-08-04

    申请号:US13274725

    申请日:2011-10-17

    CPC classification number: G06F17/30743 G06F17/3074

    Abstract: Systems and methods for audio matching are disclosed herein. In one embodiment, a system includes both interest point mixing and fingerprint mixing by using multiple interest point detection methods in parallel. Since multiple interest point detection methods are used in parallel, accuracy of audio matching is improved across a wide variety of audio signals. In addition the scalability of the disclosed audio matching system is increased by matching the fingerprint of an audio sample with a fingerprint of a reference sample versus matching an entire spectrogram. Accordingly, a more accurate and more general solution to audio matching can be accomplished.

    Abstract translation: 本文公开了用于音频匹配的系统和方法。 在一个实施例中,系统通过并行使用多个兴趣点检测方法来包括兴趣点混合和指纹混合。 由于并行地使用多个兴趣点检测方法,因此在多种音频信号中提高了音频匹配的精度。 此外,通过将音频样本的指纹与参考样本的指纹匹配以匹配整个频谱图来增加所公开的音频匹配系统的可扩展性。 因此,可以实现更准确和更一般的音频匹配解决方案。

    Detection of inactive broadcasts during live stream ingestion
    14.
    发明授权
    Detection of inactive broadcasts during live stream ingestion 有权
    在实况流摄入过程中检测无效广播

    公开(公告)号:US08938089B1

    公开(公告)日:2015-01-20

    申请号:US13533818

    申请日:2012-06-26

    CPC classification number: G06K9/00744 G06K9/00751 G10L19/018

    Abstract: Systems and methods are provided herein relating to real-time detection of inactive broadcasts during live stream ingestion. Both audio fingerprints and video fingerprints can be dynamically and continuously generated for a live stream ingestion. Sets of video fingerprints and sets of audio fingerprints can be continuously generated based on common successive overlapping time windows. A set of audio fingerprints and a set of video fingerprints can be associated with each time window. Video similarity scores and audio similarity scores can be generates for each time window to determine whether the stream is inactive or static during the time window. Only fingerprints relating to an active broadcast can be indexed in a fingerprint index.

    Abstract translation: 本文提供的系统和方法涉及在实时流摄取期间实时检测无效广播。 可以动态连续地生成音频指纹和视频指纹,用于实况流摄取。 可以基于共同的连续重叠时间窗口连续地生成视频指纹集和音频指纹集。 一组音频指纹和一组视频指纹可以与每个时间窗口相关联。 可以为每个时间窗口生成视频相似度分数和音频相似性分数,以在时间窗口中确定流是不活动还是静态。 只有与主动广播相关的指纹才能在指纹索引中索引。

    Intelligent interest point pruning for audio matching
    15.
    发明授权
    Intelligent interest point pruning for audio matching 有权
    智能兴趣点修剪音频匹配

    公开(公告)号:US08831763B1

    公开(公告)日:2014-09-09

    申请号:US13276316

    申请日:2011-10-18

    CPC classification number: G10L25/54 G10L25/87

    Abstract: System and methods for intelligently pruning interest points are disclosed herein. The systems include generating a plurality of distorted audio samples and associated distorted interest points based upon a clean audio sample. Interest points that are common to sets of distorted interest points are retained with interest points not robust to distortion discarded. The disclosed systems and methods therefore can provide for a scalable audio matching solution by eliminating interest points in reference sample fingerprints. The set of pruned interest points are robust to distortion and the benefits of both scalability and accuracy can be had.

    Abstract translation: 本文公开了用于智能修剪兴趣点的系统和方法。 系统包括基于干净的音频样本产生多个失真的音频样本和相关联的失真的兴趣点。 对于一组扭曲的兴趣点常见的兴趣点被保留,对于丢弃的失真不利于兴趣点。 因此,所公开的系统和方法可以通过消除参考样本指纹中的兴趣点来提供可扩展的音频匹配解决方案。 修剪的兴趣点的集合对于失真是稳健的,并且可以实现可扩展性和准确性的优点。

    System and method for synchronizing tag history
    16.
    发明授权
    System and method for synchronizing tag history 有权
    用于同步标签历史的系统和方法

    公开(公告)号:US08735708B1

    公开(公告)日:2014-05-27

    申请号:US13770661

    申请日:2013-02-19

    CPC classification number: G10H1/0033 G10H2240/135 G10H2240/211 G10H2240/251

    Abstract: Systems and methods for music recognition and/or tag history synchronization are described. The system includes, for example, a first device, a second device and a server. The first device is configured to record music from a surrounding environment. The first device wirelessly sends the recorded music to the server for identification. The server is configured to identify the recorded music and to generate a tag corresponding to the identified music. The first tag history is updated to include the tag which includes information corresponding to the identified music. The first device and the second device are registered with the server as part of a particular user account. The server is configured to synchronize a second tag history stored in the second device with the updated first tag history.

    Abstract translation: 描述用于音乐识别和/或标签历史同步的系统和方法。 该系统包括例如第一设备,第二设备和服务器。 第一个设备配置为从周围环境录制音乐。 第一个设备将录制的音乐无线发送到服务器进行识别。 服务器被配置为识别所记录的音乐并且生成与所识别的音乐相对应的标签。 第一标签历史被更新为包括包括与所识别的音乐相对应的信息的标签。 作为特定用户帐户的一部分,第一设备和第二设备作为服务器注册。 服务器被配置为将存储在第二设备中的第二标签历史与更新的第一标签历史进行同步。

    Systems and methods for facilitating higher confidence matching by a computer-based melody matching system
    17.
    发明授权
    Systems and methods for facilitating higher confidence matching by a computer-based melody matching system 有权
    用于促进基于计算机的旋律匹配系统的更高置信度匹配的系统和方法

    公开(公告)号:US08212135B1

    公开(公告)日:2012-07-03

    申请号:US13276566

    申请日:2011-10-19

    CPC classification number: G10H1/00 G10H2240/075 G10H2240/141

    Abstract: Systems and methods for facilitating higher confidence matches are provided. In one embodiment, a system includes a memory that stores computer executable components, and a microprocessor that executes the computer executable components stored in the memory. The components can include a metadata matching component that determines a metadata match level between metadata of a plurality of files, and a thresholding component. The thresholding component may compare a metadata threshold with the metadata match level and output a signal configured to cause a decrease in a melody matching strength threshold from a first value to a second value based at least on the metadata match level being greater than the metadata threshold.

    Abstract translation: 提供了用于促进更高置信度匹配的系统和方法。 在一个实施例中,系统包括存储计算机可执行组件的存储器和执行存储在存储器中的计算机可执行组件的微处理器。 组件可以包括确定多个文件的元数据之间的元数据匹配级别的元数据匹配组件和阈值组件。 阈值分量可以将元数据阈值与元数据匹配水平进行比较,并且输出被配置为使得旋律匹配强度阈值从第一值降低到第二值的信号,至少基于元数据匹配级别大于元数据阈值 。

    Audio matching using time-frequency onsets
    18.
    发明授权
    Audio matching using time-frequency onsets 有权
    音频匹配使用时频onsets

    公开(公告)号:US09471673B1

    公开(公告)日:2016-10-18

    申请号:US13418334

    申请日:2012-03-12

    Abstract: Systems and methods are provided herein relating to audio matching. Interest points that are onsets are generally very efficient in audio matching in that they are robust to multiple types of distortion. Prominent onsets can be detected within an audio signal excerpt as interest points and combined as a function of a set of interest points to form a descriptor. Descriptors associated with an audio signal excerpt that contain a set of prominent onsets as interest points can be used in matching the audio signal excerpt to an audio reference. The benefits in generating and using prominent onsets within descriptors improve the accuracy of an audio matching system.

    Abstract translation: 本文提供了与音频匹配有关的系统和方法。 音频匹配中的兴趣点通常非常有效,因为它们对多种类型的失真具有鲁棒性。 可以在音频信号摘录中检测突出的开始点作为兴趣点并且作为一组兴趣点的函数组合以形成描述符。 与音频信号摘录相关联的描述符可以用于将音频信号摘录与音频参考相匹配。 在描述符内生成和使用突出显示的优点可以提高音频匹配系统的准确性。

    Full digest of an audio file for identifying duplicates
    19.
    发明授权
    Full digest of an audio file for identifying duplicates 有权
    用于识别重复的音频文件的完整摘要

    公开(公告)号:US08953811B1

    公开(公告)日:2015-02-10

    申请号:US13450427

    申请日:2012-04-18

    CPC classification number: H04H60/58 H04H60/37 H04H2201/90

    Abstract: Systems and methods are provided herein relating to audio matching. A compact digest can be generated based on sets of triples, where triples are groupings of three interest points that meet threshold criteria. The compact digest can be used in identifying a potential audio match. A full digest can then be used in verifying the potential match. By using a compact digest to perform audio matching, the audio matching system can be scaled to encompass millions or billions of reference audio samples while still using the full digest to maintain accuracy.

    Abstract translation: 本文提供了与音频匹配有关的系统和方法。 可以基于三元组的集合生成紧凑的摘要,其中三元组是满足阈值标准的三个兴趣点的分组。 紧凑的摘要可用于识别潜在的音频匹配。 然后可以使用完整的摘要来验证潜在的匹配。 通过使用紧凑的摘要来执行音频匹配,音频匹配系统可以缩放以涵盖数百万或数十亿的参考音频样本,同时仍然使用完整的摘要来保持准确性。

    Noise based interest point density pruning
    20.
    发明授权
    Noise based interest point density pruning 有权
    基于噪声的兴趣点密度修剪

    公开(公告)号:US08805560B1

    公开(公告)日:2014-08-12

    申请号:US13276318

    申请日:2011-10-18

    CPC classification number: G06F17/30743 G06F17/30758 G10L25/54

    Abstract: Systems and methods for noise based interest point density pruning are disclosed herein. The systems include determining an amount of noise in an audio sample and adjusting the amount of interest points within an audio sample fingerprint based on the amount of noise. Samples containing high amounts of noise correspondingly generate fingerprints with more interest points. The disclosed systems and methods allow reference fingerprints to be reduced in size while increasing the size of sample fingerprints. The benefits in scalability do not compromise the accuracy of an audio matching system using noise based interest point density pruning.

    Abstract translation: 本文公开了基于噪声的兴趣点密度修剪的系统和方法。 系统包括确定音频样本中的噪声量,并且基于噪声量来调整音频样本指纹内的兴趣点的数量。 含有大量噪声的样本相应地产生了具有更多兴趣点的指纹。 所公开的系统和方法允许参考指纹的尺寸减小,同时增加样本指纹的大小。 可扩展性的优点不会影响使用基于噪声的兴趣点密度修剪的音频匹配系统的准确性。

Patent Agency Ranking