-
公开(公告)号:US20240112027A1
公开(公告)日:2024-04-04
申请号:US18477546
申请日:2023-09-28
Applicant: Google LLC
Inventor: Yanqi Zhou , Yanping Huang , Yifeng Lu , Andrew M. Dai , Siamak Shakeri , Zhifeng Chen , James Laudon , Quoc V. Le , Da Huang , Nan Du , David Richard So , Daiyi Peng , Yingwei Cui , Jeffrey Adgate Dean , Chang Lan
IPC: G06N3/08
CPC classification number: G06N3/08
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing neural architecture search for machine learning models. In one aspect, a method comprises receiving training data for a machine learning, generating a plurality of candidate neural networks for performing the machine learning task, wherein each candidate neural network comprises a plurality of instances of a layer block composed of a plurality of layers, for each candidate neural network, selecting a respective type for each of the plurality of layers from a set of layer types that comprises, training the candidate neural network and evaluating performance scores for the trained candidate neural networks as applied to the machine learning task, and determining a final neural network for performing the machine learning task based at least on the performance scores for the candidate neural networks.
-
公开(公告)号:US20210256390A1
公开(公告)日:2021-08-19
申请号:US17306813
申请日:2021-05-03
Applicant: Google LLC
Inventor: David Martin Dohan , David Richard So , Chen Liang , Quoc V. Le
Abstract: A method for receiving training data for training a neural network to perform a machine learning task and for searching for, using the training data, an optimized neural network architecture for performing the machine learning task is described. Searching for the optimized neural network architecture includes: maintaining population data; maintaining threshold data; and repeatedly performing the following operations: selecting one or more candidate architectures from the population data; generating a new architecture from the one or more selected candidate architectures; for the new architecture: training a neural network having the new architecture until termination criteria for the training are satisfied; and determining a final measure of fitness of the neural network having the new architecture after the training; and adding data defining the new architecture and the final measure of fitness for the neural network having the new architecture to the population data.
-
公开(公告)号:US20240378427A1
公开(公告)日:2024-11-14
申请号:US18661499
申请日:2024-05-10
Applicant: Google LLC
Inventor: Slav Petrov , Yonghui Wu , Andrew M. Dai , David Richard So , Dmitry Lepikhin , Erica Ann Moreira , Gaurav Mishra , Jonathan Hudson Clark , Maxim Krikun , Melvin Jose Johnson Premkumar , Nan Du , Orhan Firat , Rohan Anil , Siamak Shakeri , Xavier Garcia , Yanping Huang , Yong Cheng , Yuanzhong Xu , Yujing Zhang , Zachary Alexander Nado , Eric Jun Jie Ni , Kefan Xiao , Vladimir Feinberg , Jin Young Sohn , Aurko Roy
IPC: G06N3/0475 , G06F40/284
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network to perform any one or more of a variety of machine learning tasks. For example, the neural network can be configured as a generative neural network, e.g., an autoregressive generative neural network.
-
公开(公告)号:US20220383119A1
公开(公告)日:2022-12-01
申请号:US17827362
申请日:2022-05-27
Applicant: Google LLC
Inventor: David Richard So , Quoc V. Le, Jr. , Hanxiao Liu , Wojciech Andrzej Manke , Zihang Dai , Noam M. Shazeer
IPC: G06N3/08
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing a machine learning task on a network input to generate a network output. One of the systems includes an attention neural network configured to perform the machine learning task. The attention neural network includes one or more attentions layers that each include a squared ReLU activation layer, a depth-wise convolution layer, or both.
-
公开(公告)号:US20240273371A1
公开(公告)日:2024-08-15
申请号:US18431804
申请日:2024-02-02
Applicant: Google LLC
Inventor: Angelica Chen , David Richard So , David Martin Dohan
IPC: G06N3/086
CPC classification number: G06N3/086
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining an architecture for a neural network configured to perform a machine learning task. In one aspect, a method comprises: receiving training data; searching for a final architecture of the neural network, wherein the searching comprises: maintaining current population data; and repeatedly performing evolutionary architecture search steps comprising: selecting one or more candidate architectures from the current population of candidate architectures defined by the source code included in the current population data; generating an input prompt; processing the input prompt using the language model neural network to generate output source code that defines a plurality of new candidate architectures; and using the plurality of new candidate architectures defined by the output source code to update the current population data.
-
公开(公告)号:US20220383195A1
公开(公告)日:2022-12-01
申请号:US17795087
申请日:2021-02-08
Applicant: Google LLC
Inventor: Chen Liang , David Richard So , Esteban Alberto Real , Quoc V. Le
Abstract: A method for searching for an output machine learning (ML) algorithm to perform an ML task is described. The method includes: receiving a set of training examples and a set of validation examples, and generating a sequence of candidate ML algorithms to perform the task. For each candidate ML algorithm in the sequence, the method includes: setting up one or more training parameters for the candidate ML algorithm by executing a respective candidate setup function, training the candidate ML algorithm by processing the set of training examples using a respective candidate predict function and a respective candidate learn function, and evaluating a performance of the trained candidate ML algorithm by executing the respective candidate predict function on the set of validation examples to determine a performance metric. The method includes selecting a trained candidate ML algorithm with the best performance metric as the output ML algorithm for the task.
-
公开(公告)号:US20220367052A1
公开(公告)日:2022-11-17
申请号:US17745715
申请日:2022-05-16
Applicant: Google LLC
Inventor: Hanxiao Liu , David Richard So , Quoc V. Le , Zihang Dai
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing a machine learning task on a network input to generate a network output. In one aspect, one of the systems includes a neural network configured to perform the machine learning task, the neural network including one or more blocks that each include a feedforward spatial transformation unit.
-
公开(公告)号:US20240428071A1
公开(公告)日:2024-12-26
申请号:US18823611
申请日:2024-09-03
Applicant: Google LLC
Inventor: David Richard So , Quoc V. Le , Hanxiao Liu , Wojciech Andrzej Manke , Zihang Dai , Noam M. Shazeer
IPC: G06N3/08
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing a machine learning task on a network input to generate a network output. One of the systems includes an attention neural network configured to perform the machine learning task. The attention neural network includes one or more attentions layers that each include a squared ReLU activation layer, a depth-wise convolution layer, or both.
-
公开(公告)号:US20240378441A1
公开(公告)日:2024-11-14
申请号:US18661447
申请日:2024-05-10
Applicant: Google LLC
Inventor: Slav Petrov , Yonghui Wu , Andrew M. Dai , David Richard So , Dmitry Lepikhin , Erica Ann Moreira , Gaurav Mishra , Jonathan Hudson Clark , Maxim Krikun , Melvin Jose Johnson Premkumar , Nan Du , Orhan Firat , Rohan Anil , Siamak Shakeri , Xavier Garcia , Yanping Huang , Yong Cheng , Yuanzhong Xu , Yujing Zhang , Zachary Alexander Nado , Eric Jun Jie Ni , Kefan Xiao , Vladimir Feinberg , Jin Young Sohn , Aurko Roy
IPC: G06N3/08
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network to perform any one or more of a variety of machine learning tasks. For example, the neural network can be configured as a generative neural network, e.g., an autoregressive generative neural network.
-
公开(公告)号:US20230274151A1
公开(公告)日:2023-08-31
申请号:US17915796
申请日:2021-03-30
Applicant: Google LLC
Inventor: Zhen Xu , David Richard So , Andrew M. Dai
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for searching for an architecture for a neural network that performs a multi-modal task that requires operating on inputs that each include data from multiple different modalities.
-
-
-
-
-
-
-
-
-