Patent search ap:("Google LLC") AND inv:"Chitwan Saharia" Page 1

1.

发明授权
Image enhancement via iterative refinement based on machine learning models 有权

公开(公告)号：US12165289B2

公开(公告)日：2024-12-10

申请号：US18227120

申请日：2023-07-27

Applicant: Google LLC

Inventor： Chitwan Saharia , Jonathan Ho , William Chan , Tim Salimans , David Fleet , Mohammad Norouzi

IPC: G06T5/70 , G06N3/045 , G06N3/08 , G06T3/4007 , G06T5/50

Abstract: A method includes receiving, by a computing device, training data comprising a plurality of pairs of images, wherein each pair comprises an image and at least one corresponding target version of the image. The method also includes training a neural network based on the training data to predict an enhanced version of an input image, wherein the training of the neural network comprises applying a forward Gaussian diffusion process that adds Gaussian noise to the at least one corresponding target version of each of the plurality of pairs of images to enable iterative denoising of the input image, wherein the iterative denoising is based on a reverse Markov chain associated with the forward Gaussian diffusion process. The method additionally includes outputting the trained neural network.

2.

发明公开
GENERATING IMAGES USING SEQUENCES OF GENERATIVE NEURAL NETWORKS 审中-公开

公开(公告)号：US20240249456A1

公开(公告)日：2024-07-25

申请号：US18624960

申请日：2024-04-02

Applicant: Google LLC

Inventor： Chitwan Saharia , William Chan , Mohammad Norouzi , Saurabh Saxena , Yi Li , Jay Ha Whang , David James Fleet , Jonathan Ho

IPC: G06T11/60 , G06F40/284 , G06F40/40 , G06N3/08 , G06T3/4053 , G06T5/70

CPC classification number: G06T11/60 , G06F40/284 , G06F40/40 , G06N3/08 , G06T3/4053 , G06T5/70

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating images. In one aspect, a method includes: receiving an input text prompt including a sequence of text tokens in a natural language; processing the input text prompt using a text encoder neural network to generate a set of contextual embeddings of the input text prompt; and processing the contextual embeddings through a sequence of generative neural networks to generate a final output image that depicts a scene that is described by the input text prompt.

3.

发明申请
Image Enhancement via Iterative Refinement based on Machine Learning Models 有权

公开(公告)号：US20250061551A1

公开(公告)日：2025-02-20

申请号：US18939994

申请日：2024-11-07

Applicant: Google LLC

Inventor： Chitwan Saharia , Jonathan Ho , William Chan , Tim Salimans , David Fleet , Mohammad Norouzi

IPC: G06T5/70 , G06N3/045 , G06N3/08 , G06T3/4007 , G06T5/50

Abstract: A method includes receiving, by a computing device, training data comprising a plurality of pairs of images, wherein each pair comprises an image and at least one corresponding target version of the image. The method also includes training a neural network based on the training data to predict an enhanced version of an input image, wherein the training of the neural network comprises applying a forward Gaussian diffusion process that adds Gaussian noise to the at least one corresponding target version of each of the plurality of pairs of images to enable iterative denoising of the input image, wherein the iterative denoising is based on a reverse Markov chain associated with the forward Gaussian diffusion process. The method additionally includes outputting the trained neural network.

4.

发明公开
Image Enhancement via Iterative Refinement based on Machine Learning Models 审中-公开

公开(公告)号：US20230153959A1

公开(公告)日：2023-05-18

申请号：US18155420

申请日：2023-01-17

Applicant: Google LLC

Inventor： Chitwan Saharia , Jonathan Ho , William Chan , Tim Salimans , David Fleet , Mohammad Norouzi

IPC: G06T5/00 , G06N3/08 , G06N3/045 , G06T5/50 , G06T3/40

CPC classification number: G06T5/002 , G06N3/08 , G06N3/045 , G06T5/50 , G06T3/4007 , G06T2207/20081 , G06T2207/20016 , G06T2207/20084

Abstract: A method includes receiving, by a computing device, training data comprising a plurality of pairs of images, wherein each pair comprises an image and at least one corresponding target version of the image. The method also includes training a neural network based on the training data to predict an enhanced version of an input image, wherein the training of the neural network comprises applying a forward Gaussian diffusion process that adds Gaussian noise to the at least one corresponding target version of each of the plurality of pairs of images to enable iterative denoising of the input image, wherein the iterative denoising is based on a reverse Markov chain associated with the forward Gaussian diffusion process. The method additionally includes outputting the trained neural network.

5.

发明申请
SEQUENCE MODELING USING IMPUTATION 有权

公开(公告)号：US20230075716A1

公开(公告)日：2023-03-09

申请号：US17797872

申请日：2021-02-08

Applicant: Google LLC

Inventor： William Chan , Chitwan Saharia , Geoffrey E. Hinton , Mohammad Norouzi , Navdeep Jaitly

IPC: G06F40/47 , G06F40/284

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for sequence modeling. One of the methods includes receiving an input sequence having a plurality of input positions; determining a plurality of blocks of consecutive input positions; processing the input sequence using a neural network to generate a latent alignment, comprising, at each of a plurality of input time steps: receiving a partial latent alignment from a previous input time step; selecting an input position in each block, wherein the token at the selected input position of the partial latent alignment in each block is a mask token; and processing the partial latent alignment and the input sequence using the neural network to generate a new latent alignment, wherein the new latent alignment comprises, at the selected input position in each block, an output token or a blank token; and generating, using the latent alignment, an output sequence.

6.

发明公开
GENERATING VIDEOS USING SEQUENCES OF GENERATIVE NEURAL NETWORKS 审中-公开

公开(公告)号：US20240320965A1

公开(公告)日：2024-09-26

申请号：US18400856

申请日：2023-12-29

Applicant: Google LLC

Inventor： Jonathan Ho , William Chan , Chitwan Saharia , Jay Ha Whang , Tim Salimans

IPC: G06V10/82 , G06T3/4053

CPC classification number: G06V10/82 , G06T3/4053

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium. In one aspect, a method includes receiving a text prompt describing a scene; processing the text prompt using a text encoder neural network to generate a contextual embedding of the text prompt; and processing the contextual embedding using a sequence of generative neural networks to generate a final video depicting the scene.

7.

发明授权
Generating images using sequences of generative neural networks 有权

公开(公告)号：US11978141B2

公开(公告)日：2024-05-07

申请号：US18199883

申请日：2023-05-19

Applicant: Google LLC

Inventor： Chitwan Saharia , William Chan , Mohammad Norouzi , Saurabh Saxena , Yi Li , Jay Ha Whang , David James Fleet , Jonathan Ho

IPC: G06T11/60 , G06F40/284 , G06F40/40 , G06N3/08 , G06T3/40 , G06T3/4053 , G06T5/00

CPC classification number: G06T11/60 , G06F40/284 , G06F40/40 , G06N3/08 , G06T3/4053 , G06T5/002

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating images. In one aspect, a method includes: receiving an input text prompt including a sequence of text tokens in a natural language; processing the input text prompt using a text encoder neural network to generate a set of contextual embeddings of the input text prompt; and processing the contextual embeddings through a sequence of generative neural networks to generate a final output image that depicts a scene that is described by the input text prompt.

8.

发明授权
Generating videos using sequences of generative neural networks 有权

公开(公告)号：US11908180B1

公开(公告)日：2024-02-20

申请号：US18126281

申请日：2023-03-24

Applicant: Google LLC

Inventor： Jonathan Ho , William Chan , Chitwan Saharia , Jay Ha Whang , Tim Salimans

IPC: G06V10/82 , G06T3/40

CPC classification number: G06V10/82 , G06T3/4053

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium. In one aspect, a method includes receiving a text prompt describing a scene; processing the text prompt using a text encoder neural network to generate a contextual embedding of the text prompt; and processing the contextual embedding using a sequence of generative neural networks to generate a final video depicting the scene.

9.

发明公开
GENERATING IMAGES USING SEQUENCES OF GENERATIVE NEURAL NETWORKS 审中-公开

公开(公告)号：US20230377226A1

公开(公告)日：2023-11-23

申请号：US18199883

申请日：2023-05-19

Applicant: Google LLC

Inventor： Chitwan Saharia , William Chan , Mohammad Norouzi , Saurabh Saxena , Yi Li , Jay Ha Whang , David James Fleet , Jonathan Ho

IPC: G06T11/60 , G06T3/40 , G06T5/00 , G06F40/40 , G06F40/284 , G06N3/08

CPC classification number: G06T11/60 , G06T3/4053 , G06T5/002 , G06F40/40 , G06F40/284 , G06N3/08

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating images. In one aspect, a method includes: receiving an input text prompt including a sequence of text tokens in a natural language; processing the input text prompt using a text encoder neural network to generate a set of contextual embeddings of the input text prompt; and processing the contextual embeddings through a sequence of generative neural networks to generate a final output image that depicts a scene that is described by the input text prompt.

10.

发明授权
Sequence modeling using imputation 有权

公开(公告)号：US12242818B2

公开(公告)日：2025-03-04

申请号：US17797872

申请日：2021-02-08

Applicant: Google LLC

Inventor： William Chan , Chitwan Saharia , Geoffrey E. Hinton , Mohammad Norouzi , Navdeep Jaitly

IPC: G06F40/47 , G06F40/284

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for sequence modeling. One of the methods includes receiving an input sequence having a plurality of input positions; determining a plurality of blocks of consecutive input positions; processing the input sequence using a neural network to generate a latent alignment, comprising, at each of a plurality of input time steps: receiving a partial latent alignment from a previous input time step; selecting an input position in each block, wherein the token at the selected input position of the partial latent alignment in each block is a mask token; and processing the partial latent alignment and the input sequence using the neural network to generate a new latent alignment, wherein the new latent alignment comprises, at the selected input position in each block, an output token or a blank token; and generating, using the latent alignment, an output sequence.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification