Using Hierarchical Representations for Neural Network Architecture Searching

    公开(公告)号:US20200293899A1

    公开(公告)日:2020-09-17

    申请号:US16759567

    申请日:2018-10-26

    Abstract: A computer-implemented method for automatically determining a neural network architecture represents a neural network architecture as a data structure defining a hierarchical set of directed acyclic graphs in multiple levels. Each graph has an input, an output, and a plurality of nodes between the input and the output. At each level, a corresponding set of the nodes are connected pairwise by directed edges which indicate operations performed on outputs of one node to generate an input to another node. Each level is associated with a corresponding set of operations. At a lowest level, the operations associated with each edge are selected from a set of primitive operations. The method includes repeatedly generating new sample neural network architectures, and evaluating their fitness. The modification is performed by selecting a level, selecting two nodes at that level, and modifying, removing or adding an edge between those nodes according to operations associated with lower levels of the hierarchy.

    Processing text sequences using neural networks

    公开(公告)号:US10733390B2

    公开(公告)日:2020-08-04

    申请号:US16434459

    申请日:2019-06-07

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for language modeling. In one aspect, a system comprises: a masked convolutional decoder neural network that comprises a plurality of masked convolutional neural network layers and is configured to generate a respective probability distribution over a set of possible target embeddings at each of a plurality of time steps; and a modeling engine that is configured to use the respective probability distribution generated by the decoder neural network at each of the plurality of time steps to estimate a probability that a string represented by the target embeddings corresponding to the plurality of time steps belongs to the natural language.

    PROCESSING TEXT SEQUENCES USING NEURAL NETWORKS

    公开(公告)号:US20190286708A1

    公开(公告)日:2019-09-19

    申请号:US16434459

    申请日:2019-06-07

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for neural machine translation. In one aspect, a system is configured to receive an input sequence of source embeddings representing a source sequence of words in a source natural language and to generate an output sequence of target embeddings representing a target sequence of words that is a translation of the source sequence into a target natural language, the system comprising: a dilated convolutional neural network configured to process the input sequence of source embeddings to generate an encoded representation of the source sequence, and a masked dilated convolutional neural network configured to process the encoded representation of the source sequence to generate the output sequence of target embeddings.

    GENERATING AUDIO USING NEURAL NETWORKS
    45.
    发明申请

    公开(公告)号:US20190251987A1

    公开(公告)日:2019-08-15

    申请号:US16390549

    申请日:2019-04-22

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating an output sequence of audio data that comprises a respective audio sample at each of a plurality of time steps. One of the methods includes, for each of the time steps: providing a current sequence of audio data as input to a convolutional subnetwork, wherein the current sequence comprises the respective audio sample at each time step that precedes the time step in the output sequence, and wherein the convolutional subnetwork is configured to process the current sequence of audio data to generate an alternative representation for the time step; and providing the alternative representation for the time step as input to an output layer, wherein the output layer is configured to: process the alternative representation to generate an output that defines a score distribution over a plurality of possible audio samples for the time step.

    PROCESSING TEXT SEQUENCES USING NEURAL NETWORKS

    公开(公告)号:US20180329897A1

    公开(公告)日:2018-11-15

    申请号:US16032971

    申请日:2018-07-11

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for neural machine translation. In one aspect, a system is configured to receive an input sequence of source embeddings representing a source sequence of words in a source natural language and to generate an output sequence of target embeddings representing a target sequence of words that is a translation of the source sequence into a target natural language, the system comprising: a dilated convolutional neural network configured to process the input sequence of source embeddings to generate an encoded representation of the source sequence, and a masked dilated convolutional neural network configured to process the encoded representation of the source sequence to generate the output sequence of target embeddings.

    Spatial transformer modules
    47.
    发明授权

    公开(公告)号:US10032089B2

    公开(公告)日:2018-07-24

    申请号:US15174133

    申请日:2016-06-06

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing inputs using an image processing neural network system that includes a spatial transformer module. One of the methods includes receiving an input feature map derived from the one or more input images, and applying a spatial transformation to the input feature map to generate a transformed feature map, comprising: processing the input feature map to generate spatial transformation parameters for the spatial transformation, and sampling from the input feature map in accordance with the spatial transformation parameters to generate the transformed feature map.

    Using Hierarchical Representations for Neural Network Architecture Searching

    公开(公告)号:US20240249146A1

    公开(公告)日:2024-07-25

    申请号:US18415376

    申请日:2024-01-17

    CPC classification number: G06N3/086 G06F16/9024 G06N3/045 G06F17/15

    Abstract: A computer-implemented method for automatically determining a neural network architecture represents a neural network architecture as a data structure defining a hierarchical set of directed acyclic graphs in multiple levels. Each graph has an input, an output, and a plurality of nodes between the input and the output. At each level, a corresponding set of the nodes are connected pairwise by directed edges which indicate operations performed on outputs of one node to generate an input to another node. Each level is associated with a corresponding set of operations. At a lowest level, the operations associated with each edge are selected from a set of primitive operations. The method includes repeatedly generating new sample neural network architectures, and evaluating their fitness. The modification is performed by selecting a level, selecting two nodes at that level, and modifying, removing or adding an edge between those nodes according to operations associated with lower levels of the hierarchy.

    Large scale generative neural network model with inference for representation learning using adversarial training

    公开(公告)号:US11875269B2

    公开(公告)日:2024-01-16

    申请号:US16882352

    申请日:2020-05-22

    CPC classification number: G06N3/088 G06N3/045

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a generator neural network and an encoder neural network. The generator neural network generates, based on a set of latent values, data items which are samples of a distribution. The encoder neural network generates a set of latent values for a respective data item. The training method comprises jointly training the generator neural network, the encoder neural network and a discriminator neural network configured to distinguish between samples generated by the generator network and samples of the distribution which are not generated by the generator network. The discriminator neural network is configured to distinguish by processing, by the discriminator neural network, an input pair comprising a sample part and a latent part.

Patent Agency Ranking