-
公开(公告)号:US20240273371A1
公开(公告)日:2024-08-15
申请号:US18431804
申请日:2024-02-02
Applicant: Google LLC
Inventor: Angelica Chen , David Richard So , David Martin Dohan
IPC: G06N3/086
CPC classification number: G06N3/086
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining an architecture for a neural network configured to perform a machine learning task. In one aspect, a method comprises: receiving training data; searching for a final architecture of the neural network, wherein the searching comprises: maintaining current population data; and repeatedly performing evolutionary architecture search steps comprising: selecting one or more candidate architectures from the current population of candidate architectures defined by the source code included in the current population data; generating an input prompt; processing the input prompt using the language model neural network to generate output source code that defines a plurality of new candidate architectures; and using the plurality of new candidate architectures defined by the output source code to update the current population data.
-
公开(公告)号:US10997503B2
公开(公告)日:2021-05-04
申请号:US16447866
申请日:2019-06-20
Applicant: Google LLC
Inventor: David Martin Dohan , David Richard So , Chen Liang , Quoc V. Le
Abstract: A method for receiving training data for training a neural network to perform a machine learning task and for searching for, using the training data, an optimized neural network architecture for performing the machine learning task is described. Searching for the optimized neural network architecture includes: maintaining population data; maintaining threshold data; and repeatedly performing the following operations: selecting one or more candidate architectures from the population data; generating a new architecture from the one or more selected candidate architectures; for the new architecture: training a neural network having the new architecture until termination criteria for the training are satisfied; and determining a final measure of fitness of the neural network having the new architecture after the training; and adding data defining the new architecture and the final measure of fitness for the neural network having the new architecture to the population data.
-
公开(公告)号:US10991074B2
公开(公告)日:2021-04-27
申请号:US16442365
申请日:2019-06-14
Applicant: Google LLC
Inventor: Konstantinos Bousmalis , Nathan Silberman , David Martin Dohan , Dumitru Erhan , Dilip Krishnan
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing images using an image processing neural network system. One of the systems includes a domain transformation neural network implemented by one or more computers, wherein the domain transformation neural network is configured to: receive an input image from a source domain; and process a network input comprising the input image from the source domain to generate a transformed image that is a transformation of the input image from the source domain to a target domain that is different from the source domain.
-
公开(公告)号:US20250094838A1
公开(公告)日:2025-03-20
申请号:US18967327
申请日:2024-12-03
Applicant: Google LLC
Inventor: Jason Weng Wei , Dengyong Zhou , Xuezhi Wang , Dale Eric Schuurmans , Quoc V. Le , Maarten Paul Bosma , Ed Huai-Hsin Chi , Olivier Jean Andrè Bousquet , Le Hou , Charles Aloysius Sutton , Nathanael Martin Schärli , Nathan Kemp Sekiguchi Scales , Augustus Quadrozzi Odena , Sharan Ajit Narang , Guy Gur-Ari Krakover , Aakanksha Chowdhery , David Martin Dohan , Aitor Lewkowycz , Jacob Austin , Henryk Michalewski , David Luan , David J. Bieber , Anders Johan Andreassen , Maxwell Isaac Nye
IPC: G06N5/022
Abstract: An example technique for image analysis is provided. An example image analysis method includes obtaining an instructive sequence descriptive of an instructive query, an instructive response, and an instructive trace of intermediate states from the instructive query to the instructive response. The example image analysis method includes inputting, to a machine-learned model, the instructive sequence and an operative image processing query that comprises image data, wherein the machine-learned model is configured to process the operative query with attention over the instructive sequence. The example method can include generating, using the machine-learned model and responsive to the operative query, an operative image processing response that comprises an analysis of the image data.
-
公开(公告)号:US20240386202A1
公开(公告)日:2024-11-21
申请号:US18653146
申请日:2024-05-02
Applicant: Google LLC
Inventor: Matthew Douglas Hoffman , Charles Aloysius Sutton , David Martin Dohan , Sholto Francis Alexandre Douglas , Tuan Anh Le , Van Du Phan , Aaron Thomas Parisi , Ryan Michael Rifkin , Pavel Sountsov , Sharad Vikram
IPC: G06F40/284
Abstract: Systems and methods for generative language model tuning can include training the generative language model to generate sets of output text tokens with set of intermediary text tokens with training examples that include input and output pairs. The training can include processing the input with the language model to determine a predicted output and a predicted set of intermediary text tokens. The predicted set of intermediary text tokens can then be evaluated based at least in part on the output associated with the input and the predicted output.
-
公开(公告)号:US20230401451A1
公开(公告)日:2023-12-14
申请号:US18199886
申请日:2023-05-19
Applicant: Google LLC
Inventor: Yutian Chen , Xingyou Song , Chansoo Lee , Zi Wang , Qiuyi Zhang , David Martin Dohan , Sagi Perel , Joao Ferdinando Gomes de Freitas
IPC: G06N3/0985 , G06N3/0455
CPC classification number: G06N3/0985 , G06N3/0455
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a machine learning model. One of the methods includes receiving metadata for the training, generating a metadata sequence that represents the metadata, at each of a plurality of iterations: generating one or more trials that each specify a respective value for each of a set of hyperparameters, comprising, for each trial: generating an input sequence for the iteration that comprises (i) the metadata sequence and (ii) for any earlier trials, a respective sequence that represents the respective values for the hyperparameters specified by the earlier trial and a measure of performance for the trial, and processing an input sequence for the trial that comprises the input sequence for the iteration using a sequence generation neural network to generate an output sequence that represents respective values for the hyperparameters.
-
公开(公告)号:US20230394328A1
公开(公告)日:2023-12-07
申请号:US17881746
申请日:2022-08-05
Applicant: Google LLC
Inventor: Jason Weng Wei , Dengyong Zhou , Dale Eric Schuurmans , Quoc V. Le , Maarten Paul Bosma , Ed Huai-Hsin Chi , Olivier Jean Andrè Bousquet , Le Hou , Nathan Kemp Sekiguchi Scales , David J. Bieber , Charles Aloysius Sutton , Nathanael Martin Schärli , Augustus Quadrozzi Odena , Sharan Ajit Narang , Guy Gur-Ari Krakover , Aakanksha Chowdhery , Aitor Lewkowycz , Jiageng Luan , David Martin Dohan , Henryk Michalewski , Jacob Austin , Anders Johan Andreassen , Maxwell Isaac Nye , Xuezhi Wang
IPC: G06N5/02
CPC classification number: G06N5/022
Abstract: Example embodiments of aspects of the present disclosure provide an example computer-implemented method for improved prompting of a machine-learned model. The example method can include obtaining an instructive sequence descriptive of an instructive query, an instructive response, and an instructive trace of intermediate states from the instructive query to the instructive response. The example method can include inputting, to a machine-learned model, the instructive sequence and an operative query, wherein the machine-learned model is configured to process the operative query with attention over the instructive sequence. The example method can include generating, using the machine-learned model and responsive to the operative query, an operative response.
-
8.
公开(公告)号:US20230244938A1
公开(公告)日:2023-08-03
申请号:US18160776
申请日:2023-01-27
Applicant: Google LLC
Inventor: Jason Weng Wei , Dengyong Zhou , Xuezhi Wang , Dale Eric Schuurmans , Quoc V. Le , Maarten Paul Bosma , Ed Huai-Hsin Chi , Olivier Jean Andrè Bousquet , Le Hou , Charles Aloysius Sutton , Nathanael Martin Schärli , Nathan Kemp Sekiguchi Scales , Augustus Quadrozzi Odena , Sharan Ajit Narang , Guy Gur-Ari Krakover , Aakanksha Chowdhery , David Martin Dohan , Aitor Lewkowycz , Henryk Michalewski , Jiageng Luan , David J. Bieber , Jacob Austin , Anders Johan Andreassen , Maxwell Isaac Nye , Yi Tay , Mostafa Dehghani
IPC: G06N3/08
CPC classification number: G06N3/08
Abstract: An example method for pretraining a machine-learned model is provided. The example method includes obtaining a plurality of different combinations of configuration parameters of a pretraining objective framework. The example method includes generating, using the pretraining objective framework, a plurality of corrupted training examples from one or more training examples, wherein the plurality of corrupted training examples are respectively generated according to the plurality of different combinations. The example method includes inputting the plurality of corrupted training examples into the machine-learned model, wherein the machine-learned model is configured to generate uncorrupted subportions corresponding to corrupted subportions of the corrupted training examples. The example method includes obtaining, from the machine-learned model, a plurality of outputs respectively generated by the machine-learned model based on the plurality of corrupted training examples. The example method includes updating one or more parameters of the machine-learned model based on an evaluation of the plurality of outputs.
-
公开(公告)号:US20230083892A1
公开(公告)日:2023-03-16
申请号:US17798024
申请日:2021-02-08
Applicant: Google LLC
Inventor: David Benjamin Belanger , Georgiana Andreea Gane , Christof Angermueller , David W. Sculley, II , David Martin Dohan , Kevin Patrick Murphy , Lucy Colwell , Zelda Elaine Mariet
Abstract: Methods and systems for performing black box optimization to identify an output that optimizes an objective.
-
公开(公告)号:US20210256390A1
公开(公告)日:2021-08-19
申请号:US17306813
申请日:2021-05-03
Applicant: Google LLC
Inventor: David Martin Dohan , David Richard So , Chen Liang , Quoc V. Le
Abstract: A method for receiving training data for training a neural network to perform a machine learning task and for searching for, using the training data, an optimized neural network architecture for performing the machine learning task is described. Searching for the optimized neural network architecture includes: maintaining population data; maintaining threshold data; and repeatedly performing the following operations: selecting one or more candidate architectures from the population data; generating a new architecture from the one or more selected candidate architectures; for the new architecture: training a neural network having the new architecture until termination criteria for the training are satisfied; and determining a final measure of fitness of the neural network having the new architecture after the training; and adding data defining the new architecture and the final measure of fitness for the neural network having the new architecture to the population data.
-
-
-
-
-
-
-
-
-