QUERY RESPONSE USING MEDIA CONSUMPTION HISTORY
    71.
    发明申请
    QUERY RESPONSE USING MEDIA CONSUMPTION HISTORY 审中-公开
    使用媒体消费历史查询响应

    公开(公告)号:US20170004132A1

    公开(公告)日:2017-01-05

    申请号:US15267463

    申请日:2016-09-16

    Applicant: Google Inc.

    Inventor: Matthew Sharifi

    Abstract: Methods, systems, and apparatus for receiving a natural language query of a user, and environmental data, identifying a media item based on the environmental data, determining an entity type based on the natural language query, selecting an entity associated with the media item that matches the entity type, selecting, from a media consumption database that identifies media items that have been indicated as consumed by the user, one or more media items that have been indicated as consumed by the user and that are associated with the selected entity, and providing a response to the query based on selecting the one or more media items that have been indicated as consumed by the user and that are associated with the selected entity.

    Abstract translation: 用于接收用户的自然语言查询的方法,系统和装置,以及环境数据,基于环境数据识别媒体项目,基于自然语言查询确定实体类型,选择与媒体项目相关联的实体, 匹配实体类型,从媒体消费数据库中选择,该媒体消费数据库标识已被指示为用户消费的媒体项目,已被指示为由用户消费并且与所选择的实体相关联的一个或多个媒体项目,以及 基于选择已被指示为由用户消费并且与所选择的实体相关联的一个或多个媒体项来向所述查询提供响应。

    Speaker identification using hash-based indexing
    73.
    发明授权
    Speaker identification using hash-based indexing 有权
    扬声器识别使用基于散列的索引

    公开(公告)号:US09514753B2

    公开(公告)日:2016-12-06

    申请号:US14523198

    申请日:2014-10-24

    Applicant: Google Inc.

    CPC classification number: G10L17/02 G10L17/005 G10L17/08 G10L17/18 G10L25/51

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing speaker identification. In some implementations, an utterance vector that is derived from an utterance is obtained. Hash values are determined for the utterance vector according to multiple different hash functions. A set of speaker vectors from a plurality of hash tables is determined using the hash values, where each speaker vector was derived from one or more utterances of a respective speaker. The speaker vectors in the set are compared with the utterance vector. A speaker vector is selected based on comparing the speaker vectors in the set with the utterance vector.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的用于执行说话人识别的计算机程序。 在一些实现中,获得从话语导出的话语向量。 根据多个不同的哈希函数为发声向量确定哈希值。 使用散列值来确定来自多个散列表的一组扬声器向量,其中每个扬声器向量是从相应说话者的一个或多个话语导出的。 将集合中的扬声器矢量与发声矢量进行比较。 基于将集合中的扬声器矢量与发声矢量进行比较来选择扬声器矢量。

    Initiating actions based on partial hotwords
    75.
    发明授权
    Initiating actions based on partial hotwords 有权
    基于部分热门词汇启动操作

    公开(公告)号:US09502026B2

    公开(公告)日:2016-11-22

    申请号:US14991092

    申请日:2016-01-08

    Applicant: Google Inc.

    Inventor: Matthew Sharifi

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, receiving audio data; determining that an initial portion of the audio data corresponds to an initial portion of a hotword; in response to determining that the initial portion of the audio data corresponds to the initial portion of the hotword, selecting, from among a set of one or more actions that are performed when the entire hotword is detected, a subset of the one or more actions; and causing one or more actions of the subset to be performed.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,接收音频数据; 确定音频数据的初始部分对应于热门词的初始部分; 响应于确定音频数据的初始部分对应于热门词的初始部分,从在检测到整个热词时执行的一个或多个动作的集合中选择一个或多个动作的子集 ; 并且引起所述子集的一个或多个动作被执行。

    Audio Data Classification
    76.
    发明申请
    Audio Data Classification 审中-公开
    音频数据分类

    公开(公告)号:US20160322066A1

    公开(公告)日:2016-11-03

    申请号:US13932198

    申请日:2013-07-01

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for analyzing an audio sample to determine whether the audio sample includes music audio data. One or more detectors, including a spectral fluctuation detector, a peak repetition detector, and a beat pitch detector, may analyze the audio sample and generate a score that represents whether the audio sample includes music audio data. One or more of the scores may be combined to determine whether the audio sample includes music audio data or non-music audio data.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于分析音频样本以确定音频样本是否包括音乐音频数据。 一个或多个检测器,包括光谱波动检测器,峰值重复检测器和拍频间隔检测器,可以分析音频样本并产生表示音频样本是否包括音乐音频数据的得分。 可以组合一个或多个分数以确定音频样本是否包括音乐音频数据或非音乐音频数据。

    Positioning using audio recognition
    78.
    发明授权
    Positioning using audio recognition 有权
    使用音频识别定位

    公开(公告)号:US09435878B1

    公开(公告)日:2016-09-06

    申请号:US14488858

    申请日:2014-09-17

    Applicant: Google Inc.

    Inventor: Matthew Sharifi

    CPC classification number: G01S5/24 G06F17/30743 G10L25/48

    Abstract: Systems and methods for determining location based on audio fingerprinting are disclosed. An extraction component extracts a set of interest points from an audio signal associated with an audio announcement. Then a matching component determines if the extracted set of interest points matches a set of interest points representative of an audio fingerprint in a data store comprising audio fingerprints. In an aspect, the audio fingerprints in the audio fingerprint data store represent announcements for underground transportation systems. A location component further determines location information associated with the audio fingerprint based in part on the set of extracted interest points matching the set of audio interest points representative of the audio fingerprint in the data store.

    Abstract translation: 公开了基于音频指纹识别位置的系统和方法。 提取组件从与音频通知相关联的音频信号中提取一组感兴趣点。 然后,匹配组件确定所提取的感兴趣组是否与包括音频指纹的数据存储器中的表示音频指纹的一组感兴趣点匹配。 在一方面,音频指纹数据存储器中的音频指纹代表地下运输系统的公告。 位置组件还部分地基于与代表数据存储器中的音频指纹的音频兴趣点集合匹配的提取的兴趣点集合来确定与音频指纹相关联的位置信息。

    DRAG-AND-DROP ON A MOBILE DEVICE
    79.
    发明申请
    DRAG-AND-DROP ON A MOBILE DEVICE 有权
    DRAG-AND DROP在移动设备上

    公开(公告)号:US20160117072A1

    公开(公告)日:2016-04-28

    申请号:US14522927

    申请日:2014-10-24

    Applicant: GOOGLE INC.

    Abstract: Implementations provide an improved drag-and-drop operation on a mobile device. For example, a method includes identifying a drag area in a user interface of a first mobile application in response to a drag command, identifying an entity from a data store based on recognition performed on content in the drag area, receiving a drop location associated with a second mobile application, determining an action to perform in the second mobile application based on the drop location, and performing the action in the second mobile action using the entity. Another method may include receiving a selection of a smart copy control for a text input control in a first mobile application, receiving a selected area of a display generated by a second mobile application, identifying an entity in the selected area, automatically navigating back to the text input control, and pasting a description of the entity in the text input control.

    Abstract translation: 实现方式可以在移动设备上提供改进的拖放操作。 例如,一种方法包括响应于拖曳命令识别第一移动应用的用户界面中的拖曳区域,基于对拖曳区域中的内容执行的识别从数据存储区识别实体,接收与 第二移动应用,基于所述丢弃位置确定在所述第二移动应用中执行的动作,以及使用所述实体在所述第二移动动作中执行所述动作。 另一种方法可以包括接收对第一移动应用中的文本输入控制的智能复制控制的选择,接收由第二移动应用生成的显示的选定区域,识别所选区域中的实体,自动导航回到 文本输入控件,并在文本输入控件中粘贴实体的描述。

    HOTWORD DETECTION ON MULTIPLE DEVICES

    公开(公告)号:US20160104480A1

    公开(公告)日:2016-04-14

    申请号:US14675932

    申请日:2015-04-01

    Applicant: Google Inc.

    Inventor: Matthew Sharifi

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for hotword detection on multiple devices are disclosed. In one aspect, a method includes the actions of receiving, by a first computing device, audio data that corresponds to an utterance. The actions further include determining a first value corresponding to a likelihood that the utterance includes a hotword. The actions further include receiving a second value corresponding to a likelihood that the utterance includes the hotword, the second value being determined by a second computing device. The actions further include comparing the first value and the second value. The actions further include based on comparing the first value to the second value, initiating speech recognition processing on the audio data.

Patent Agency Ranking