IDENTIFYING MEDIA CONTENT
    11.
    发明申请

    公开(公告)号:US20140114659A1

    公开(公告)日:2014-04-24

    申请号:US14142042

    申请日:2013-12-27

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving (i) audio data that encodes a spoken natural language query, and (ii) environmental audio data, obtaining a transcription of the spoken natural language query, determining a particular content type associated with one or more keywords in the transcription, providing at least a portion of the environmental audio data to a content recognition engine, and identifying a content item that has been output by the content recognition engine, and that matches the particular content type.

    Detection of inactive broadcasts during live stream ingestion
    14.
    发明授权
    Detection of inactive broadcasts during live stream ingestion 有权
    在实况流摄入过程中检测无效广播

    公开(公告)号:US09536151B1

    公开(公告)日:2017-01-03

    申请号:US14581789

    申请日:2014-12-23

    Applicant: Google Inc.

    CPC classification number: G06K9/00744 G06K9/00751 G10L19/018

    Abstract: Systems and methods are provided herein relating to real-time detection of inactive broadcasts during live stream ingestion. Both audio fingerprints and video fingerprints can be dynamically and continuously generated for a live stream ingestion. Sets of video fingerprints and sets of audio fingerprints can be continuously generated based on common successive overlapping time windows. A set of audio fingerprints and a set of video fingerprints can be associated with each time window. Video similarity scores and audio similarity scores can be generates for each time window to determine whether the stream is inactive or static during the time window. Only fingerprints relating to an active broadcast can be indexed in a fingerprint index.

    Abstract translation: 本文提供的系统和方法涉及在实时流摄取期间实时检测无效广播。 可以动态连续地生成音频指纹和视频指纹,用于实况流摄取。 可以基于共同的连续重叠时间窗口连续地生成视频指纹集和音频指纹集。 一组音频指纹和一组视频指纹可以与每个时间窗口相关联。 可以为每个时间窗口生成视频相似度分数和音频相似性分数,以在时间窗口中确定流是不活动还是静态。 只有与主动广播相关的指纹才能在指纹索引中索引。

    System and method for adding pitch shift resistance to an audio fingerprint
    15.
    发明授权
    System and method for adding pitch shift resistance to an audio fingerprint 有权
    为音频指纹添加音高变化阻力的系统和方法

    公开(公告)号:US09159327B1

    公开(公告)日:2015-10-13

    申请号:US13723034

    申请日:2012-12-20

    Applicant: Google Inc.

    Abstract: Systems and techniques for adding pitch shift resistance to an audio fingerprint are presented. In particular, an audio track for a media file is received. A first audio fingerprint for the audio track with a first pitch shift and an Nth audio fingerprint for the audio track with an Mth pitch shift are generated, where N is an integer greater than or equal to two and M is an integer greater than or equal to two. A combined audio fingerprint is generated from at least the first audio fingerprint and the Nth audio fingerprint.

    Abstract translation: 介绍了增加音高转换电阻到音频指纹的系统和技术。 特别地,接收用于媒体文件的音轨。 产生具有第一间距移位的音轨的第一音频指纹和具有第M音调移位的音轨的第N个音频指纹,其中N是大于或等于2的整数,M是大于或等于的整数 到两个。 从至少第一音频指纹和第N音频指纹生成组合音频指纹。

    PROVIDING DEVICE-SPECIFIC INSTRUCTIONS IN RESPONSE TO A PERCEPTION OF A MEDIA CONTENT SEGMENT
    16.
    发明申请
    PROVIDING DEVICE-SPECIFIC INSTRUCTIONS IN RESPONSE TO A PERCEPTION OF A MEDIA CONTENT SEGMENT 审中-公开
    针对媒体内容部分的意见提供设备特定说明

    公开(公告)号:US20150019612A1

    公开(公告)日:2015-01-15

    申请号:US13938216

    申请日:2013-07-09

    Applicant: Google Inc.

    CPC classification number: H04L67/02 H04L67/20 H04L67/303

    Abstract: Systems and methods are disclosed for providing device-specific instructions in response to a perception of a media content segment. In one implementation, a processing device captures, at a user device, one or more media content segments. The processing device provides the one or more media content segments to a remote device. The processing device receives one or more instructions, each of the one or more instructions being associated with at least one of the one or more media content segments and corresponding to one or more operations. The processing device initiates execution of at least one of the one or more instructions.

    Abstract translation: 公开了用于响应于媒体内容段的感知来提供设备特定指令的系统和方法。 在一个实现中,处理设备在用户设备处捕获一个或多个媒体内容段。 处理设备将一个或多个媒体内容段提供给远程设备。 所述处理设备接收一个或多个指令,所述一个或多个指令中的每一个与所述一个或多个媒体内容段中的至少一个相关联并对应于一个或多个操作。 处理装置启动执行一个或多个指令中的至少一个指令。

    Identifying media content
    17.
    发明授权

    公开(公告)号:US08484017B1

    公开(公告)日:2013-07-09

    申请号:US13626351

    申请日:2012-09-25

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving (i) audio data that encodes a spoken natural language query, and (ii) environmental audio data, obtaining a transcription of the spoken natural language query, determining a particular content type associated with one or more keywords in the transcription, providing at least a portion of the environmental audio data to a content recognition engine, and identifying a content item that has been output by the content recognition engine, and that matches the particular content type.

    Differentiating between near identical versions of a song
    18.
    发明授权
    Differentiating between near identical versions of a song 有权
    区分一首歌曲的近似相同版本

    公开(公告)号:US09153239B1

    公开(公告)日:2015-10-06

    申请号:US13803686

    申请日:2013-03-14

    Applicant: Google Inc.

    CPC classification number: G10L25/51 G10L25/18

    Abstract: Identifying near identical versions of a probe sample from reference files comprises identifying discriminative regions of reference matches by generating a similarity matrix. The discriminative time frames are communicated to a client device and additional data associated with the probe sample can be retrieved having features of the discriminative regions. Based on the additional data, a single match can be generated to identify the probe sample.

    Abstract translation: 识别来自参考文件的探针样本的近似相同版本包括通过生成相似性矩阵来识别参考匹配的识别区域。 鉴别时间帧被传送到客户端设备,并且可以检索与探测器样本相关联的附加数据,其具有区别区域的特征。 基于附加数据,可以生成单个匹配以识别探针样本。

    Methods for enforcing time alignment for speed resistant audio matching
    19.
    发明授权
    Methods for enforcing time alignment for speed resistant audio matching 有权
    强制音速匹配的时间对齐方法

    公开(公告)号:US09069849B1

    公开(公告)日:2015-06-30

    申请号:US13648472

    申请日:2012-10-10

    Applicant: Google Inc.

    CPC classification number: G06F17/30743 G10L21/04 G10L25/51

    Abstract: Systems and methods are provided herein relating to speed resistant audio matching. Descriptors can be generated for a received audio signal and matched with reference descriptors. A set of hits for respective reference samples can be generated based on the matching. A histogram can then be generated that correlates probe sample hit time with reference sample hit time. In one implementation, a rolling window can be used in analyzing the histogram allowing for slight variances in the timing between probe sample hits and reference sample hits. In another implementation, the histogram generated can be based on an estimated time stretch of the probe sample. In yet another implementation, a set of histograms can be generated based on a minimum speed change, a maximum speed change, and a speed step. Histograms can be evaluated to determine a most likely matching histogram.

    Abstract translation: 本文提供了与耐速度音频匹配相关的系统和方法。 可以为接收到的音频信号生成描述符,并与参考描述符匹配。 可以基于匹配来生成针对各个参考样本的一组命中。 然后可以生成将探针样品命中时间与参考样品命中时间相关联的直方图。 在一个实施方式中,可以使用滚动窗口来分析直方图,从而可以在探针样品命中和参考样品命中之间的时间上有轻微的变化。 在另一实施方案中,生成的直方图可以基于探测样本的估计时间延长。 在又一实现中,可以基于最小速度变化,最大速度变化和速度步长来生成一组直方图。 可以对直方图进行评估,以确定最可能的匹配直方图。

    PROVIDING DEVICE-SPECIFIC INSTRUCTIONS IN RESPONSE TO A PERCEPTION OF A MEDIA CONTENT SEGMENT
    20.
    发明申请
    PROVIDING DEVICE-SPECIFIC INSTRUCTIONS IN RESPONSE TO A PERCEPTION OF A MEDIA CONTENT SEGMENT 审中-公开
    针对媒体内容部分的意见提供设备特定说明

    公开(公告)号:US20150019611A1

    公开(公告)日:2015-01-15

    申请号:US13938197

    申请日:2013-07-09

    Applicant: Google Inc.

    CPC classification number: H04L67/02 H04L67/20 H04L67/303

    Abstract: Systems and methods are disclosed for providing device-specific instructions in response to a perception of a media content segment. In one implementation, a processing device receives one or more media content segments from a user device. The processing device processes the one or more media content segments to determine one or more operations associated with the one or more media content segments. The processing device selects, based on one or more characteristics associated with the user device, at least one of the one or more operations. The processing device provides one or more instructions to perform the at least one of the one or more operations in relation to the user device.

    Abstract translation: 公开了用于响应于媒体内容段的感知来提供设备特定指令的系统和方法。 在一个实现中,处理设备从用户设备接收一个或多个媒体内容片段。 所述处理设备处理所述一个或多个媒体内容片段以确定与所述一个或多个媒体内容片段相关联的一个或多个操作。 处理装置基于与用户装置相关联的一个或多个特征来选择一个或多个操作中的至少一个。 处理设备提供一个或多个指令以执行与用户设备相关的一个或多个操作中的至少一个操作。

Patent Agency Ranking