Patent search ap:("Google LLC") AND inv:"David Martin Dohan" Page 1

1.

发明公开
CODE-LEVEL NEURAL ARCHITECTURE SEARCH USING LANGUAGE MODELS 审中-公开

公开(公告)号：US20240273371A1

公开(公告)日：2024-08-15

申请号：US18431804

申请日：2024-02-02

Applicant: Google LLC

Inventor： Angelica Chen , David Richard So , David Martin Dohan

IPC: G06N3/086

CPC classification number: G06N3/086

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining an architecture for a neural network configured to perform a machine learning task. In one aspect, a method comprises: receiving training data; searching for a final architecture of the neural network, wherein the searching comprises: maintaining current population data; and repeatedly performing evolutionary architecture search steps comprising: selecting one or more candidate architectures from the current population of candidate architectures defined by the source code included in the current population data; generating an input prompt; processing the input prompt using the language model neural network to generate output source code that defines a plurality of new candidate architectures; and using the plurality of new candidate architectures defined by the output source code to update the current population data.

2.

发明授权
Computationally efficient neural network architecture search 有权

公开(公告)号：US10997503B2

公开(公告)日：2021-05-04

申请号：US16447866

申请日：2019-06-20

Applicant: Google LLC

Inventor： David Martin Dohan , David Richard So , Chen Liang , Quoc V. Le

IPC: G06N3/08 , G06N3/04

Abstract: A method for receiving training data for training a neural network to perform a machine learning task and for searching for, using the training data, an optimized neural network architecture for performing the machine learning task is described. Searching for the optimized neural network architecture includes: maintaining population data; maintaining threshold data; and repeatedly performing the following operations: selecting one or more candidate architectures from the population data; generating a new architecture from the one or more selected candidate architectures; for the new architecture: training a neural network having the new architecture until termination criteria for the training are satisfied; and determining a final measure of fitness of the neural network having the new architecture after the training; and adding data defining the new architecture and the final measure of fitness for the neural network having the new architecture to the population data.

3.

发明授权
Transforming source domain images into target domain images 有权

公开(公告)号：US10991074B2

公开(公告)日：2021-04-27

申请号：US16442365

申请日：2019-06-14

Applicant: Google LLC

Inventor： Konstantinos Bousmalis , Nathan Silberman , David Martin Dohan , Dumitru Erhan , Dilip Krishnan

IPC: G06T3/40 , G06N20/10 , G06N3/08 , G06K9/62 , G06N3/04

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing images using an image processing neural network system. One of the systems includes a domain transformation neural network implemented by one or more computers, wherein the domain transformation neural network is configured to: receive an input image from a source domain; and process a network input comprising the input image from the source domain to generate a transformed image that is a transformation of the input image from the source domain to a target domain that is different from the source domain.

4.

发明申请
Image Analysis by Prompting of Machine-Learned Models Using Chain of Thought 有权

公开(公告)号：US20250094838A1

公开(公告)日：2025-03-20

申请号：US18967327

申请日：2024-12-03

Applicant: Google LLC

Inventor： Jason Weng Wei , Dengyong Zhou , Xuezhi Wang , Dale Eric Schuurmans , Quoc V. Le , Maarten Paul Bosma , Ed Huai-Hsin Chi , Olivier Jean Andrè Bousquet , Le Hou , Charles Aloysius Sutton , Nathanael Martin Schärli , Nathan Kemp Sekiguchi Scales , Augustus Quadrozzi Odena , Sharan Ajit Narang , Guy Gur-Ari Krakover , Aakanksha Chowdhery , David Martin Dohan , Aitor Lewkowycz , Jacob Austin , Henryk Michalewski , David Luan , David J. Bieber , Anders Johan Andreassen , Maxwell Isaac Nye

IPC: G06N5/022

Abstract: An example technique for image analysis is provided. An example image analysis method includes obtaining an instructive sequence descriptive of an instructive query, an instructive response, and an instructive trace of intermediate states from the instructive query to the instructive response. The example image analysis method includes inputting, to a machine-learned model, the instructive sequence and an operative image processing query that comprises image data, wherein the machine-learned model is configured to process the operative query with attention over the instructive sequence. The example method can include generating, using the machine-learned model and responsive to the operative query, an operative image processing response that comprises an analysis of the image data.

5.

发明申请
TUNING GENERATIVE MODELS USING LATENT-VARIABLE INFERENCE 有权

公开(公告)号：US20240386202A1

公开(公告)日：2024-11-21

申请号：US18653146

申请日：2024-05-02

Applicant: Google LLC

Inventor： Matthew Douglas Hoffman , Charles Aloysius Sutton , David Martin Dohan , Sholto Francis Alexandre Douglas , Tuan Anh Le , Van Du Phan , Aaron Thomas Parisi , Ryan Michael Rifkin , Pavel Sountsov , Sharad Vikram

IPC: G06F40/284

Abstract: Systems and methods for generative language model tuning can include training the generative language model to generate sets of output text tokens with set of intermediary text tokens with training examples that include input and output pairs. The training can include processing the input with the language model to determine a predicted output and a predicted set of intermediary text tokens. The predicted set of intermediary text tokens can then be evaluated based at least in part on the output associated with the input and the predicted output.

6.

发明公开
DETERMINING HYPERPARAMETERS USING SEQUENCE GENERATION NEURAL NETWORKS 审中-公开

公开(公告)号：US20230401451A1

公开(公告)日：2023-12-14

申请号：US18199886

申请日：2023-05-19

Applicant: Google LLC

Inventor： Yutian Chen , Xingyou Song , Chansoo Lee , Zi Wang , Qiuyi Zhang , David Martin Dohan , Sagi Perel , Joao Ferdinando Gomes de Freitas

IPC: G06N3/0985 , G06N3/0455

CPC classification number: G06N3/0985 , G06N3/0455

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a machine learning model. One of the methods includes receiving metadata for the training, generating a metadata sequence that represents the metadata, at each of a plurality of iterations: generating one or more trials that each specify a respective value for each of a set of hyperparameters, comprising, for each trial: generating an input sequence for the iteration that comprises (i) the metadata sequence and (ii) for any earlier trials, a respective sequence that represents the respective values for the hyperparameters specified by the earlier trial and a measure of performance for the trial, and processing an input sequence for the trial that comprises the input sequence for the iteration using a sequence generation neural network to generate an output sequence that represents respective values for the hyperparameters.

7.

发明公开
Prompting Machine-Learned Models Using Chains of Thought 审中-公开

公开(公告)号：US20230394328A1

公开(公告)日：2023-12-07

申请号：US17881746

申请日：2022-08-05

Applicant: Google LLC

Inventor： Jason Weng Wei , Dengyong Zhou , Dale Eric Schuurmans , Quoc V. Le , Maarten Paul Bosma , Ed Huai-Hsin Chi , Olivier Jean Andrè Bousquet , Le Hou , Nathan Kemp Sekiguchi Scales , David J. Bieber , Charles Aloysius Sutton , Nathanael Martin Schärli , Augustus Quadrozzi Odena , Sharan Ajit Narang , Guy Gur-Ari Krakover , Aakanksha Chowdhery , Aitor Lewkowycz , Jiageng Luan , David Martin Dohan , Henryk Michalewski , Jacob Austin , Anders Johan Andreassen , Maxwell Isaac Nye , Xuezhi Wang

IPC: G06N5/02

CPC classification number: G06N5/022

Abstract: Example embodiments of aspects of the present disclosure provide an example computer-implemented method for improved prompting of a machine-learned model. The example method can include obtaining an instructive sequence descriptive of an instructive query, an instructive response, and an instructive trace of intermediate states from the instructive query to the instructive response. The example method can include inputting, to a machine-learned model, the instructive sequence and an operative query, wherein the machine-learned model is configured to process the operative query with attention over the instructive sequence. The example method can include generating, using the machine-learned model and responsive to the operative query, an operative response.

8.

发明公开
Using Chains of Thought to Prompt Machine-Learned Models Pre-Trained on Diversified Objectives 审中-公开

公开(公告)号：US20230244938A1

公开(公告)日：2023-08-03

申请号：US18160776

申请日：2023-01-27

Applicant: Google LLC

Inventor： Jason Weng Wei , Dengyong Zhou , Xuezhi Wang , Dale Eric Schuurmans , Quoc V. Le , Maarten Paul Bosma , Ed Huai-Hsin Chi , Olivier Jean Andrè Bousquet , Le Hou , Charles Aloysius Sutton , Nathanael Martin Schärli , Nathan Kemp Sekiguchi Scales , Augustus Quadrozzi Odena , Sharan Ajit Narang , Guy Gur-Ari Krakover , Aakanksha Chowdhery , David Martin Dohan , Aitor Lewkowycz , Henryk Michalewski , Jiageng Luan , David J. Bieber , Jacob Austin , Anders Johan Andreassen , Maxwell Isaac Nye , Yi Tay , Mostafa Dehghani

IPC: G06N3/08

CPC classification number: G06N3/08

Abstract: An example method for pretraining a machine-learned model is provided. The example method includes obtaining a plurality of different combinations of configuration parameters of a pretraining objective framework. The example method includes generating, using the pretraining objective framework, a plurality of corrupted training examples from one or more training examples, wherein the plurality of corrupted training examples are respectively generated according to the plurality of different combinations. The example method includes inputting the plurality of corrupted training examples into the machine-learned model, wherein the machine-learned model is configured to generate uncorrupted subportions corresponding to corrupted subportions of the corrupted training examples. The example method includes obtaining, from the machine-learned model, a plurality of outputs respectively generated by the machine-learned model based on the plurality of corrupted training examples. The example method includes updating one or more parameters of the machine-learned model based on an evaluation of the plurality of outputs.

9.

发明申请
POPULATION-BASED BLACK-BOX OPTIMIZATION 有权

公开(公告)号：US20230083892A1

公开(公告)日：2023-03-16

申请号：US17798024

申请日：2021-02-08

Applicant: Google LLC

Inventor： David Benjamin Belanger , Georgiana Andreea Gane , Christof Angermueller , David W. Sculley, II , David Martin Dohan , Kevin Patrick Murphy , Lucy Colwell , Zelda Elaine Mariet

IPC: G06Q40/02 , G06K9/62

Abstract: Methods and systems for performing black box optimization to identify an output that optimizes an objective.

10.

发明申请
COMPUTATIONALLY EFFICIENT NEURAL NETWORK ARCHITECTURE SEARCH 有权

公开(公告)号：US20210256390A1

公开(公告)日：2021-08-19

申请号：US17306813

申请日：2021-05-03

Applicant: Google LLC

Inventor： David Martin Dohan , David Richard So , Chen Liang , Quoc V. Le

IPC: G06N3/08 , G06N3/04

Abstract: A method for receiving training data for training a neural network to perform a machine learning task and for searching for, using the training data, an optimized neural network architecture for performing the machine learning task is described. Searching for the optimized neural network architecture includes: maintaining population data; maintaining threshold data; and repeatedly performing the following operations: selecting one or more candidate architectures from the population data; generating a new architecture from the one or more selected candidate architectures; for the new architecture: training a neural network having the new architecture until termination criteria for the training are satisfied; and determining a final measure of fitness of the neural network having the new architecture after the training; and adding data defining the new architecture and the final measure of fitness for the neural network having the new architecture to the population data.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification