Hotword recognition
    51.
    发明授权

    公开(公告)号:US09934783B2

    公开(公告)日:2018-04-03

    申请号:US15176482

    申请日:2016-06-08

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data corresponding to an utterance, determining that the audio data corresponds to a hotword, generating a hotword audio fingerprint of the audio data that is determined to correspond to the hotword, comparing the hotword audio fingerprint to one or more stored audio fingerprints of audio data that was previously determined to correspond to the hotword, detecting whether the hotword audio fingerprint matches a stored audio fingerprint of audio data that was previously determined to correspond to the hotword based on whether the comparison indicates a similarity between the hotword audio fingerprint and one of the one or more stored audio fingerprints that satisfies a predetermined threshold, and in response to detecting that the hotword audio fingerprint matches a stored audio fingerprint, disabling access to a computing device into which the utterance was spoken.

    FREQUENCY BASED AUDIO ANALYSIS USING NEURAL NETWORKS

    公开(公告)号:US20170330586A1

    公开(公告)日:2017-11-16

    申请号:US15151362

    申请日:2016-05-10

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for frequency based audio analysis using neural networks. One of the methods includes training a neural network that includes a plurality of neural network layers on training data, wherein the neural network is configured to receive frequency domain features of an audio sample and to process the frequency domain features to generate a neural network output for the audio sample, wherein the neural network comprises (i) a convolutional layer that is configured to map frequency domain features to logarithmic scaled frequency domain features, wherein the convolutional layer comprises one or more convolutional layer filters, and (ii) one or more other neural network layers having respective layer parameters that are configured to process the logarithmic scaled frequency domain features to generate the neural network output.

    ADAPTIVE ARTIFICIAL NEURAL NETWORK SELECTION TECHNIQUES

    公开(公告)号:US20170277994A1

    公开(公告)日:2017-09-28

    申请号:US15082653

    申请日:2016-03-28

    Applicant: Google Inc.

    Abstract: Computer-implemented techniques can include obtaining, by a client computing device, a digital media item and a request for a processing task on the digital item and determining a set of operating parameters based on (i) available computing resources at the client computing device and (ii) a condition of a network. Based on the set of operating parameters, the client computing device or a server computing device can select one of a plurality of artificial neural networks (ANNs), each ANN defining which portions of the processing task are to be performed by the client and server computing devices. The client and server computing devices can coordinate processing of the processing task according to the selected ANN. The client computing device can also obtain final processing results corresponding to a final evaluation of the processing task and generate an output based on the final processing results.

    SYSTEMS AND METHODS FOR LIVE MEDIA CONTENT MATCHING

    公开(公告)号:US20170257650A1

    公开(公告)日:2017-09-07

    申请号:US15603357

    申请日:2017-05-23

    Applicant: GOOGLE INC.

    Inventor: Matthew Sharifi

    Abstract: Systems and methods for matching live media content are disclosed. At a server, obtaining first media content from a client device, herein the first media content corresponds to a portion of media content being played on the client device, and the first media content is associated with a predefined expiration time; obtaining second media content from one or more content feeds, wherein the second media content also corresponds to a portion of the media content being played on the client device; in accordance with a determination that the second media content corresponds to a portion of the media content that has been played on the client device: before the predefined expiration time, obtaining third media content corresponding to the media content being played on the client device, from the one or more content feeds; and comparing the first media content with the third media content.

    ADAPTIVE TEXT-TO-SPEECH OUTPUTS
    59.
    发明申请

    公开(公告)号:US20170221472A1

    公开(公告)日:2017-08-03

    申请号:US15477360

    申请日:2017-04-03

    Applicant: Google Inc.

    CPC classification number: G10L13/043 G06F17/274 G06F17/2775 G10L13/08

    Abstract: In some implementations, a language proficiency of a user of a client device is determined by one or more computers. The one or more computers then determines a text segment for output by a text-to-speech module based on the determined language proficiency of the user. After determining the text segment for output, the one or more computers generates audio data including a synthesized utterance of the text segment. The audio data including the synthesized utterance of the text segment is then provided to the client device for output.

Patent Agency Ranking