Frequency based audio analysis using neural networks

    公开(公告)号:US10460747B2

    公开(公告)日:2019-10-29

    申请号:US15151362

    申请日:2016-05-10

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for frequency based audio analysis using neural networks. One of the methods includes training a neural network that includes a plurality of neural network layers on training data, wherein the neural network is configured to receive frequency domain features of an audio sample and to process the frequency domain features to generate a neural network output for the audio sample, wherein the neural network comprises (i) a convolutional layer that is configured to map frequency domain features to logarithmic scaled frequency domain features, wherein the convolutional layer comprises one or more convolutional layer filters, and (ii) one or more other neural network layers having respective layer parameters that are configured to process the logarithmic scaled frequency domain features to generate the neural network output.

    Audio data classification
    202.
    发明授权

    公开(公告)号:US10424321B1

    公开(公告)日:2019-09-24

    申请号:US13932158

    申请日:2013-07-01

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for analyzing an audio sample to determine whether the audio sample includes music audio data. One or more detectors, including a spectral fluctuation detector, a peak repetition detector, and a beat pitch detector, may analyze the audio sample and generate a score that represents whether the audio sample includes music audio data. One or more of the scores may be combined to determine whether the audio sample includes music audio data or non-music audio data.

    AUTO-ADJUST PLAYBACK SPEED AND CONTEXTUAL INFORMATION

    公开(公告)号:US20190268632A1

    公开(公告)日:2019-08-29

    申请号:US15908481

    申请日:2018-02-28

    Applicant: Google LLC

    Abstract: Implementations disclose methods and systems for providing a media item at an adjusted playback. A method includes receiving, from a first user device, a playback request from a first user for a first media item including one or more portions of media content; determining an adjusted playback for at least one portion of the first media item that is different than a default playback for the at least one portion of the first media item. The determining is based on previous playback behavior of one or more users in relation to one or more media items that each included one or more portions of media content corresponding to the one or more portions media content of the first media item; and causing the at least one portion of the first media item to be rendered on the first user device at the adjusted playback.

    Techniques for suggesting electronic messages based on user activity and other context

    公开(公告)号:US10387461B2

    公开(公告)日:2019-08-20

    申请号:US15238304

    申请日:2016-08-16

    Applicant: Google LLC

    Abstract: A computer-implemented technique can include detecting an initiation of composing an electronic message by a user, obtaining contextual information for the electronic message from a source external to a text of the electronic message, obtaining a first suggestion for the text of the electronic message based on the contextual information, detecting an operating condition indicative of a user activity during which the user is likely to experience difficulty in typing, in response to detecting the operating condition, obtaining a second suggestion for the electronic message based on the contextual information, the second suggestion being more detailed than the first suggestion, and outputting one of the first and second suggestions depending on one or more other conditions.

    On-device query rewriting
    206.
    发明授权

    公开(公告)号:US10346493B1

    公开(公告)日:2019-07-09

    申请号:US15595019

    申请日:2017-05-15

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, relating to on-device query annotating. In some implementations, a search query is received, and a mobile device identifies a reference to a particular entity and a reference to a category based on the query. A model that is stored on the mobile device and stores one or more facts that are associated with one or more entities is accessed. A subset of facts from among the facts that are stored in the model for the particular entity is selected. The search query is annotated based at least on one or more facts of the subset of facts that are stored in the model for the particular entity. The annotated search query is transmitted, from the mobile device to a search engine, for processing. A result of processing the annotated search query is received by the mobile device.

    Hotword recognition
    208.
    发明授权

    公开(公告)号:US10262659B2

    公开(公告)日:2019-04-16

    申请号:US15909519

    申请日:2018-03-01

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data corresponding to an utterance, determining that the audio data corresponds to a hotword, generating a hotword audio fingerprint of the audio data that is determined to correspond to the hotword, comparing the hotword audio fingerprint to one or more stored audio fingerprints of audio data that was previously determined to correspond to the hotword, detecting whether the hotword audio fingerprint matches a stored audio fingerprint of audio data that was previously determined to correspond to the hotword based on whether the comparison indicates a similarity between the hotword audio fingerprint and one of the one or more stored audio fingerprints that satisfies a predetermined threshold, and in response to detecting that the hotword audio fingerprint matches a stored audio fingerprint, disabling access to a computing device into which the utterance was spoken.

    SEGMENT-BASED SPEAKER VERIFICATION USING DYNAMICALLY GENERATED PHRASES

    公开(公告)号:US20180308492A1

    公开(公告)日:2018-10-25

    申请号:US16017690

    申请日:2018-06-25

    Applicant: Google LLC

    CPC classification number: G10L17/24 G10L15/02 G10L17/04 G10L2015/025

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for verifying an identity of a user. The methods, systems, and apparatus include actions of receiving a request for a verification phrase for verifying an identity of a user. Additional actions include, in response to receiving the request for the verification phrase for verifying the identity of the user, identifying subwords to be included in the verification phrase and in response to identifying the subwords to be included in the verification phrase, obtaining a candidate phrase that includes at least some of the identified subwords as the verification phrase. Further actions include providing the verification phrase as a response to the request for the verification phrase for verifying the identity of the user.

Patent Agency Ranking