CLASSIFYING DATA OBJECTS
    1.
    发明公开

    公开(公告)号:US20240220527A1

    公开(公告)日:2024-07-04

    申请号:US18606458

    申请日:2024-03-15

    Applicant: Google LLC

    CPC classification number: G06F16/35 G06F16/50

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for classifying data objects. One of the methods includes obtaining data that associates each term in a vocabulary of terms with a respective high-dimensional representation of the term; obtaining classification data for a data object, wherein the classification data includes a respective score for each of a plurality of categories, and wherein each of the categories is associated with a respective category label; computing an aggregate high-dimensional representation for the data object from high-dimensional representations for the category labels associated with the categories and the respective scores; identifying a first term in the vocabulary of terms having a high-dimensional representation that is closest to the aggregate high-dimensional representation; and selecting the first term as a category label for the data object.

    Neural Networks For Speaker Verification
    2.
    发明公开

    公开(公告)号:US20240038245A1

    公开(公告)日:2024-02-01

    申请号:US18485069

    申请日:2023-10-11

    Applicant: Google LLC

    CPC classification number: G10L17/18 G10L17/04 G10L17/02

    Abstract: This document generally describes systems, methods, devices, and other techniques related to speaker verification, including (i) training a neural network for a speaker verification model, (ii) enrolling users at a client device, and (iii) verifying identities of users based on characteristics of the users' voices. Some implementations include a computer-implemented method. The method can include receiving, at a computing device, data that characterizes an utterance of a user of the computing device. A speaker representation can be generated, at the computing device, for the utterance using a neural network on the computing device. The neural network can be trained based on a plurality of training samples that each: (i) include data that characterizes a first utterance and data that characterizes one or more second utterances, and (ii) are labeled as a matching speakers sample or a non-matching speakers sample.

    IMAGE PROCESSING NEURAL NETWORKS WITH DYNAMIC FILTER ACTIVATION

    公开(公告)号:US20220004849A1

    公开(公告)日:2022-01-06

    申请号:US17295561

    申请日:2019-11-20

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing images using neural networks. One of the methods includes receiving a network input; processing the network input through a gater neural network to generate a gating vector that includes a respective value for each of a plurality of filters; determining, from the gating vector and for each of the plurality of filters, whether the filter is active or inactive; and processing the network input through the main convolutional neural network to generate an image processing output, comprising, for each convolutional layer in the first plurality of convolutional layers: receiving an input feature map for the convolutional layer; and generating an output feature map, the generating comprising: for each filter of the convolutional layer that is inactive: setting the output channel for the filter to have all zero elements.

    Processing and generating sets using recurrent neural networks

    公开(公告)号:US11829860B2

    公开(公告)日:2023-11-28

    申请号:US17679625

    申请日:2022-02-24

    Applicant: Google LLC

    CPC classification number: G06N3/044 G06N3/045

    Abstract: In one aspect, this specification describes a recurrent neural network system implemented by one or more computers that is configured to process input sets to generate neural network outputs for each input set. The input set can be a collection of multiple inputs for which the recurrent neural network should generate the same neural network output regardless of the order in which the inputs are arranged in the collection. The recurrent neural network system can include a read neural network, a process neural network, and a write neural network. In another aspect, this specification describes a system implemented as computer programs on one or more computers in one or more locations that is configured to train a recurrent neural network that receives a neural network input and sequentially emits outputs to generate an output sequence for the neural network input.

    COMPLEX LINEAR PROJECTION FOR ACOUSTIC MODELING

    公开(公告)号:US20200286468A1

    公开(公告)日:2020-09-10

    申请号:US16879322

    申请日:2020-05-20

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech recognition using complex linear projection are disclosed. In one aspect, a method includes the actions of receiving audio data corresponding to an utterance. The method further includes generating frequency domain data using the audio data. The method further includes processing the frequency domain data using complex linear projection. The method further includes providing the processed frequency domain data to a neural network trained as an acoustic model. The method further includes generating a transcription for the utterance that is determined based at least on output that the neural network provides in response to receiving the processed frequency domain data.

    Generating Natural Language Descriptions of Images

    公开(公告)号:US20200042866A1

    公开(公告)日:2020-02-06

    申请号:US16538712

    申请日:2019-08-12

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating descriptions of input images. One of the methods includes obtaining an input image; processing the input image using a first neural network to generate an alternative representation for the input image; and processing the alternative representation for the input image using a second neural network to generate a sequence of a plurality of words in a target natural language that describes the input image.

    LABEL CONSISTENCY FOR IMAGE ANALYSIS
    10.
    发明申请

    公开(公告)号:US20200012905A1

    公开(公告)日:2020-01-09

    申请号:US16576321

    申请日:2019-09-19

    Applicant: Google LLC

    Abstract: Systems and techniques are disclosed for labeling objects within an image. The objects may be labeled by selecting an option from a plurality of options such that each option is a potential label for the object. An option may have an option score associated with. Additionally, a relation score may be calculated for a first option and a second option corresponding to a second object in an image. The relation score may be based on a frequency, probability, or observance corresponding to the co-occurrence of text associated with the first option and the second option in a text corpus such as the World Wide Web. An option may be selected as a label for an object based on a global score calculated based at least on an option score and relation score associated with the option.

Patent Agency Ranking