-
公开(公告)号:US09934783B2
公开(公告)日:2018-04-03
申请号:US15176482
申请日:2016-06-08
Applicant: Google Inc.
Inventor: Matthew Sharifi , Jakob Nicolaus Foerster
CPC classification number: G10L15/22 , G06F21/31 , G06F21/32 , G10L15/02 , G10L15/08 , G10L15/1815 , G10L17/08 , G10L25/51 , G10L2015/088 , G10L2015/223
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data corresponding to an utterance, determining that the audio data corresponds to a hotword, generating a hotword audio fingerprint of the audio data that is determined to correspond to the hotword, comparing the hotword audio fingerprint to one or more stored audio fingerprints of audio data that was previously determined to correspond to the hotword, detecting whether the hotword audio fingerprint matches a stored audio fingerprint of audio data that was previously determined to correspond to the hotword based on whether the comparison indicates a similarity between the hotword audio fingerprint and one of the one or more stored audio fingerprints that satisfies a predetermined threshold, and in response to detecting that the hotword audio fingerprint matches a stored audio fingerprint, disabling access to a computing device into which the utterance was spoken.
-
公开(公告)号:US09911419B2
公开(公告)日:2018-03-06
申请号:US15463786
申请日:2017-03-20
Applicant: Google Inc.
Inventor: Matthew Sharifi
CPC classification number: G10L15/22 , G06F3/04842 , G06F3/167 , G10L15/063 , G10L15/08 , G10L15/18 , G10L15/265 , G10L15/30 , G10L2015/0631 , G10L2015/0638 , G10L2015/088 , G10L2015/223
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for obtaining, for each of multiple words or sub-words, audio data corresponding to multiple users speaking the word or sub-word; training, for each of the multiple words or sub-words, a pre-computed hotword model for the word or sub-word based on the audio data for the word or sub-word; receiving a candidate hotword from a computing device; identifying one or more pre-computed hotword models that correspond to the candidate hotword; and providing the identified, pre-computed hotword models to the computing device.
-
53.
公开(公告)号:US20180039608A1
公开(公告)日:2018-02-08
申请号:US15227598
申请日:2016-08-03
Applicant: Google Inc.
Inventor: Matthew Sharifi , Jakob Nicolaus Foerster
CPC classification number: G06F17/24 , G06F17/25 , G06F17/273 , H04L51/02 , H04L51/063 , H04L51/16
Abstract: A computer-executed technique for correcting previously received textual messages based on a later received textual message can include receiving and displaying a first textual message in a user interface. A second textual message is received and, based on this second textual message, the first textual message is analyzed to determine if it includes a correctable error. When the first textual message is determined to include an error, a language model is utilized to determine an incorrect word therein. A corrected word is programmatically determined and is used to automatically replace the incorrect word to generate a corrected first textual message. The user interface is modified such that the corrected first received textual message replaces the first received textual message.
-
公开(公告)号:US09870623B2
公开(公告)日:2018-01-16
申请号:US15154957
申请日:2016-05-14
Applicant: Google Inc.
Inventor: Dominik Roblek , David Petrou , Matthew Sharifi
IPC: G06T7/30 , G06F3/0484 , G06F3/0488 , G06T7/90
CPC classification number: G06T7/30 , G06F3/04842 , G06F3/0488 , G06F17/30047 , G06F17/30867 , G06T7/90
Abstract: Methods and apparatus directed to segmenting content displayed on a computing device into regions. The segmenting of content displayed on the computing device into regions is accomplished via analysis of pixels of a “screenshot image” that captures at least a portion of (e.g., all of) the displayed content. Individual pixels of the screenshot image may be analyzed to determine one or more regions of the screenshot image and to optionally assign a corresponding semantic type to each of the regions. Some implementations are further directed to generating, based on one or more of the regions, interactive content to provide for presentation to the user via the computing device.
-
公开(公告)号:US09836597B2
公开(公告)日:2017-12-05
申请号:US15379094
申请日:2016-12-14
Applicant: GOOGLE INC.
Inventor: Alfred Zalmon Spector , David Petrou , Blaise Aguera-Arcas , Matthew Sharifi
CPC classification number: G06F21/54 , G06F17/30539 , G06F17/30876 , G06F21/6218 , G06F2221/0724 , G06T1/0021 , G06T1/20 , G06T1/60 , G06T11/60
Abstract: Systems and methods prevent or restrict the mining of content on a mobile device. For example, a method may include identifying a mining-restriction mark in low order bits or high order bits in a frame buffer of a mobile device and determining whether the mining-restriction mark prevents mining of content. Mining includes non-transient storage of a copy or derivations of data in the frame buffer. The method may also include preventing the mining of data in the frame buffer when the mining-restriction mark prevents mining.
-
公开(公告)号:US20170330586A1
公开(公告)日:2017-11-16
申请号:US15151362
申请日:2016-05-10
Applicant: Google Inc.
Inventor: Dominik Roblek , Matthew Sharifi
CPC classification number: G10L25/30 , G06F11/0715 , G06F11/079 , G06N3/0454 , G06N3/084 , G06N3/088
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for frequency based audio analysis using neural networks. One of the methods includes training a neural network that includes a plurality of neural network layers on training data, wherein the neural network is configured to receive frequency domain features of an audio sample and to process the frequency domain features to generate a neural network output for the audio sample, wherein the neural network comprises (i) a convolutional layer that is configured to map frequency domain features to logarithmic scaled frequency domain features, wherein the convolutional layer comprises one or more convolutional layer filters, and (ii) one or more other neural network layers having respective layer parameters that are configured to process the logarithmic scaled frequency domain features to generate the neural network output.
-
公开(公告)号:US20170277994A1
公开(公告)日:2017-09-28
申请号:US15082653
申请日:2016-03-28
Applicant: Google Inc.
Inventor: Matthew Sharifi , Jakob Nicolaus Foerster
CPC classification number: G06N3/08 , G06F9/5044 , G06F9/505 , G06F9/5094 , G06N3/0454 , H04L67/42 , Y02D10/22
Abstract: Computer-implemented techniques can include obtaining, by a client computing device, a digital media item and a request for a processing task on the digital item and determining a set of operating parameters based on (i) available computing resources at the client computing device and (ii) a condition of a network. Based on the set of operating parameters, the client computing device or a server computing device can select one of a plurality of artificial neural networks (ANNs), each ANN defining which portions of the processing task are to be performed by the client and server computing devices. The client and server computing devices can coordinate processing of the processing task according to the selected ANN. The client computing device can also obtain final processing results corresponding to a final evaluation of the processing task and generate an output based on the final processing results.
-
公开(公告)号:US20170257650A1
公开(公告)日:2017-09-07
申请号:US15603357
申请日:2017-05-23
Applicant: GOOGLE INC.
Inventor: Matthew Sharifi
IPC: H04N21/235 , H04N21/25 , H04N21/466 , H04N21/84 , H04N21/234
CPC classification number: H04N21/2353 , H04N21/23418 , H04N21/251 , H04N21/4668 , H04N21/84
Abstract: Systems and methods for matching live media content are disclosed. At a server, obtaining first media content from a client device, herein the first media content corresponds to a portion of media content being played on the client device, and the first media content is associated with a predefined expiration time; obtaining second media content from one or more content feeds, wherein the second media content also corresponds to a portion of the media content being played on the client device; in accordance with a determination that the second media content corresponds to a portion of the media content that has been played on the client device: before the predefined expiration time, obtaining third media content corresponding to the media content being played on the client device, from the one or more content feeds; and comparing the first media content with the third media content.
-
公开(公告)号:US20170221472A1
公开(公告)日:2017-08-03
申请号:US15477360
申请日:2017-04-03
Applicant: Google Inc.
Inventor: Matthew Sharifi , Jakob Nicolaus Foerster
CPC classification number: G10L13/043 , G06F17/274 , G06F17/2775 , G10L13/08
Abstract: In some implementations, a language proficiency of a user of a client device is determined by one or more computers. The one or more computers then determines a text segment for output by a text-to-speech module based on the determined language proficiency of the user. After determining the text segment for output, the one or more computers generates audio data including a synthesized utterance of the text segment. The audio data including the synthesized utterance of the text segment is then provided to the client device for output.
-
公开(公告)号:US09720955B1
公开(公告)日:2017-08-01
申请号:US15289661
申请日:2016-10-10
Applicant: Google Inc.
Inventor: Jing Cao , Alexa Greenberg , Abhanshu Sharma , Yanchao Su , Nicholas Kong , Muhammad Mohsin , Jacek Jurewicz , Wei Huang , Matthew Sharifi , Benjamin Sidhom
IPC: G06F3/00 , G06F17/30 , G06F3/0488 , G06F3/0482
CPC classification number: H04L51/046 , G06F3/0237 , G06F3/0482 , G06F3/04842 , G06F3/04886 , G06F17/30398 , G06F17/30554 , G06F17/30643 , G06F17/30864 , G06F17/30867 , G06F17/30973
Abstract: A computing device is described that includes at least one processor and a memory including instructions that when executed cause the at least one processor to output, for display, a graphical keyboard comprising a plurality of keys, and determine, based on an indication of a selection of one or more keys from the plurality of keys, text of an electronic communication. The instructions, when executed, further cause the at least one processor to identify, based at least in part on the text, a searchable entity or trigger phrase, generate, based on the searchable entity or trigger phrase, a search query, and output, for display, within the graphical keyboard, a graphical indication to indicate that the computing device generated the search query.
-
-
-
-
-
-
-
-
-