-
公开(公告)号:US20150127342A1
公开(公告)日:2015-05-07
申请号:US14523198
申请日:2014-10-24
Applicant: Google Inc.
Inventor: Matthew Sharifi , Ignacio Lopez Moreno , Ludwig Schmidt
CPC classification number: G10L17/02 , G10L17/005 , G10L17/08 , G10L17/18 , G10L25/51
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing speaker identification. In some implementations, an utterance vector that is derived from an utterance is obtained. Hash values are determined for the utterance vector according to multiple different hash functions. A set of speaker vectors from a plurality of hash tables is determined using the hash values, where each speaker vector was derived from one or more utterances of a respective speaker. The speaker vectors in the set are compared with the utterance vector. A speaker vector is selected based on comparing the speaker vectors in the set with the utterance vector.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的用于执行说话人识别的计算机程序。 在一些实现中,获得从话语导出的话语向量。 根据多个不同的哈希函数为发声向量确定哈希值。 使用散列值来确定来自多个散列表的一组扬声器向量,其中每个扬声器向量是从相应说话者的一个或多个话语导出的。 将集合中的扬声器矢量与发声矢量进行比较。 基于将集合中的扬声器矢量与发声矢量进行比较来选择扬声器矢量。
-
公开(公告)号:US09002835B2
公开(公告)日:2015-04-07
申请号:US14047708
申请日:2013-10-07
Applicant: Google Inc.
Inventor: Matthew Sharifi
CPC classification number: G06F17/30041 , G06F17/30026 , G06F17/30029 , G06F17/30035 , G06F17/30044 , G06F17/30401 , G06F17/30424 , G06F17/30477 , G06F17/3053 , G06F17/30746 , G06F17/30787 , G06F17/30867 , G06F17/30876 , G06Q30/02 , G06Q30/0631 , G10L25/54
Abstract: Methods, systems, and apparatus for receiving a natural language query of a user, and environmental data, identifying a media item based on the environmental data, determining an entity type based on the natural language query, selecting an entity associated with the media item that matches the entity type, selecting, from a media consumption database that identifies media items that have been indicated as consumed by the user, one or more media items that have been indicated as consumed by the user and that are associated with the selected entity, and providing a response to the query based on selecting the one or more media items that have been indicated as consumed by the user and that are associated with the selected entity.
Abstract translation: 用于接收用户的自然语言查询的方法,系统和装置,以及环境数据,基于环境数据识别媒体项目,基于自然语言查询确定实体类型,选择与媒体项目相关联的实体, 匹配实体类型,从媒体消费数据库中选择,该媒体消费数据库标识已被指示为用户消费的媒体项目,已被指示为由用户消费并且与所选择的实体相关联的一个或多个媒体项目,以及 基于选择已被指示为由用户消费并且与所选择的实体相关联的一个或多个媒体项来向所述查询提供响应。
-
123.
公开(公告)号:US08918382B1
公开(公告)日:2014-12-23
申请号:US13889681
申请日:2013-05-08
Applicant: Google Inc.
Inventor: Matthew Sharifi , Gheorghe Postelnicu
IPC: G06F17/30
CPC classification number: G06F17/30722 , G06F17/30038 , G06F17/30371 , G06F17/30817
Abstract: This disclosure relates to learning common spelling errors of metadata terms associated with content through content matching, such as content matching using fingerprints.
Abstract translation: 本公开涉及通过内容匹配(例如使用指纹的内容匹配)来学习与内容相关联的元数据术语的常见拼写错误。
-
124.
公开(公告)号:US08838609B1
公开(公告)日:2014-09-16
申请号:US13648511
申请日:2012-10-10
Applicant: Google Inc.
Inventor: Matthew Sharifi , Gheorghe Postelnicu
CPC classification number: G06F17/3002 , G06F17/30784 , G06F17/30864 , H04N21/2187 , H04N21/23418
Abstract: Down scoring overcrowded bands via IDF weighting scores provides a soft way to reduce the effect of common bands from Locality Sensitive Hashing (LSH) processes. An index component indexes live video references of a live streaming infrastructure pathway process in a reference index. A scoring component scores a set of bands with a set of inverse document frequency (IDF) weighting scores in the reference index. A high score is generated for bands that are featured in a small number of references and a low score is generated for bands featured in a high number of references.
Abstract translation: 通过IDF加权分数的下划线过度拥挤的频带提供了一种柔性的方法来减少局部敏感哈希(LSH)过程中常用频带的影响。 索引组件在参考索引中索引实况流基础设施路径进程的实时视频参考。 评分组件在参考指标中以一组逆文档频率(IDF)加权分数对一组频带进行评分。 对于以少量参考为特征的频带,产生高分,并且对于大量参考中的频带生成低分数。
-
公开(公告)号:US20140074466A1
公开(公告)日:2014-03-13
申请号:US13626439
申请日:2012-09-25
Applicant: GOOGLE INC.
Inventor: Matthew Sharifi , Gheorghe Postelnicu
IPC: G10L15/26
CPC classification number: G10L15/22 , G06F16/3329 , G06F16/3344 , G06F16/433 , G06F16/686 , G10L15/08 , G10L15/1815 , G10L15/24 , G10L15/30 , G10L2015/088 , G10L2015/223 , G10L2015/225
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data encoding an utterance and environmental data, obtaining a transcription of the utterance, identifying an entity using the environmental data, submitting a query to a natural language query processing engine, wherein the query includes at least a portion of the transcription and data that identifies the entity, and obtaining one or more results of the query.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于接收编码话语和环境数据的音频数据,获得话语的转录,使用环境数据识别实体,向自然语言提交查询 查询处理引擎,其中查询包括识别实体的转录和数据的至少一部分,以及获得查询的一个或多个结果。
-
公开(公告)号:US10803391B2
公开(公告)日:2020-10-13
申请号:US14812877
申请日:2015-07-29
Applicant: GOOGLE INC.
Inventor: Matthew Sharifi , David Petrou , Pranav Khaitan
IPC: G06N5/04 , G06N20/00 , G06K9/62 , G06N5/02 , G06F40/295 , G06F3/0481 , G06F16/2457 , G06F16/36
Abstract: Systems and methods are provided for a personal entity modeling for computing devices. For example, a computing device comprises at least one processor and memory storing instructions that, when executed by the at least one processor, cause the mobile device to perform operations including identifying a personal entity in content generated for display on the mobile device, generating training examples for the personal entity from the content, and updating an embedding used to model the personal entity using the training examples. The embedding may be used to make predictions regarding the personal entity. For example, the operations may also include predicting an association between a first personal entity displayed on the computing device and a second entity based on the embedding, and providing a recommendation, to be displayed on the computing device, related to the second entity.
-
公开(公告)号:US20180232127A1
公开(公告)日:2018-08-16
申请号:US15433587
申请日:2017-02-15
Applicant: Google Inc.
Inventor: Matthew Sharifi , Jakob Nicolaus Foerster
IPC: G06F3/0484 , H04L12/58 , G06F17/24 , G06N99/00
CPC classification number: G06F3/04842 , G06F17/24 , G06N20/00 , G06Q10/107 , H04L51/02 , H04L51/04 , H04L51/22
Abstract: A system and method for grouping and organizing structured responses in a communication application at a computing device. A structured question in a plurality of messages can be detected based on a structured question model trained via machine learning. A structured question can be a question predicted by the structured question model to have a number of possible answers fewer than a threshold. A user interface element, corresponding to the structured question, can include a structured summarization that includes one or more answers to the structured question present in the plurality of messages from the plurality of users, and/or a structured response template in which at least a subset of possible answers are presented and are selectable. A command to include the generated graphical user interface element in a record of the communication session in a graphical user interface corresponding to the communication application.
-
公开(公告)号:US20180183739A1
公开(公告)日:2018-06-28
申请号:US15391074
申请日:2016-12-27
Applicant: Google Inc.
Inventor: Jakob Foerster , Matthew Sharifi
CPC classification number: H04L67/306 , H04L51/10 , H04L51/20 , H04L51/32
Abstract: A system and method includes receiving, by a server system from a first user device executing a first instance of a messaging application, a first message for a user of a second user device executing a second instance of the messaging application. The method also includes determining whether the first message includes a first reference to a first media item. The method includes responsive to determining that the first message includes the first reference to the first media item, generating media playlist information identifying the first media item. The method further includes sending the media playlist information identifying the first media item to a content sharing platform, the first media item to be added to a media playlist maintained by the content sharing platform.
-
公开(公告)号:US09886942B2
公开(公告)日:2018-02-06
申请号:US15477360
申请日:2017-04-03
Applicant: Google Inc.
Inventor: Matthew Sharifi , Jakob Nicolaus Foerster
CPC classification number: G10L13/043 , G06F17/274 , G06F17/2775 , G10L13/08
Abstract: In some implementations, a language proficiency of a user of a client device is determined by one or more computers. The one or more computers then determines a text segment for output by a text-to-speech module based on the determined language proficiency of the user. After determining the text segment for output, the one or more computers generates audio data including a synthesized utterance of the text segment. The audio data including the synthesized utterance of the text segment is then provided to the client device for output.
-
公开(公告)号:US09877071B1
公开(公告)日:2018-01-23
申请号:US13873821
申请日:2013-04-30
Applicant: Google Inc.
Inventor: Matthew Sharifi , Ant Oztaskent , Yaroslav Volovich
CPC classification number: H04N21/442 , G06F17/30749 , G10L25/51 , G10L25/54 , G10L25/57 , G10L25/81 , H04H60/39 , H04H60/48 , H04H60/58 , H04H60/65 , H04H60/74 , H04H2201/90 , H04L65/4076 , H04L67/22 , H04L67/2804
Abstract: This disclosure relates to systems and methods for proactively determining identification information for a plurality of audio segments within a plurality of broadcast media streams, and providing identification information associated with specific audio portions of a broadcast media stream automatically or upon request.
-
-
-
-
-
-
-
-
-