Patent search ap:("Google LLC") AND inv:"William Chan" Page 1

1.

发明公开
GENERATING VIDEOS USING DIFFUSION MODELS 审中-公开

公开(公告)号：US20240338936A1

公开(公告)日：2024-10-10

申请号：US18296938

申请日：2023-04-06

Applicant: Google LLC

Inventor： Jonathan Ho , Tim Salimans , Alexey Alexeevich Gritsenko , William Chan , Mohammad Norouzi , David James Fleet

IPC: G06V10/82 , G06V10/771 , H04N7/01

CPC classification number: G06V10/82 , G06V10/771 , H04N7/0117 , H04N7/013

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating an output video conditioned on an input. In one aspect, a method comprises receiving the input; initializing a current intermediate representation; generating an output video by updating the current intermediate representation at each of a plurality of iterations, wherein the updating comprises, at each iteration: processing an intermediate input for the iteration comprising the current intermediate representation using a diffusion model that is configured to process the intermediate input to generate a noise output; and updating the current intermediate representation using the noise output for the iteration.

2.

发明公开
GENERATING VIDEOS USING SEQUENCES OF GENERATIVE NEURAL NETWORKS 审中-公开

公开(公告)号：US20240320965A1

公开(公告)日：2024-09-26

申请号：US18400856

申请日：2023-12-29

Applicant: Google LLC

Inventor： Jonathan Ho , William Chan , Chitwan Saharia , Jay Ha Whang , Tim Salimans

IPC: G06V10/82 , G06T3/4053

CPC classification number: G06V10/82 , G06T3/4053

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium. In one aspect, a method includes receiving a text prompt describing a scene; processing the text prompt using a text encoder neural network to generate a contextual embedding of the text prompt; and processing the contextual embedding using a sequence of generative neural networks to generate a final video depicting the scene.

3.

发明授权
Generating images using sequences of generative neural networks 有权

公开(公告)号：US11978141B2

公开(公告)日：2024-05-07

申请号：US18199883

申请日：2023-05-19

Applicant: Google LLC

Inventor： Chitwan Saharia , William Chan , Mohammad Norouzi , Saurabh Saxena , Yi Li , Jay Ha Whang , David James Fleet , Jonathan Ho

IPC: G06T11/60 , G06F40/284 , G06F40/40 , G06N3/08 , G06T3/40 , G06T3/4053 , G06T5/00

CPC classification number: G06T11/60 , G06F40/284 , G06F40/40 , G06N3/08 , G06T3/4053 , G06T5/002

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating images. In one aspect, a method includes: receiving an input text prompt including a sequence of text tokens in a natural language; processing the input text prompt using a text encoder neural network to generate a set of contextual embeddings of the input text prompt; and processing the contextual embeddings through a sequence of generative neural networks to generate a final output image that depicts a scene that is described by the input text prompt.

4.

发明授权
Generating videos using sequences of generative neural networks 有权

公开(公告)号：US11908180B1

公开(公告)日：2024-02-20

申请号：US18126281

申请日：2023-03-24

Applicant: Google LLC

Inventor： Jonathan Ho , William Chan , Chitwan Saharia , Jay Ha Whang , Tim Salimans

IPC: G06V10/82 , G06T3/40

CPC classification number: G06V10/82 , G06T3/4053

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium. In one aspect, a method includes receiving a text prompt describing a scene; processing the text prompt using a text encoder neural network to generate a contextual embedding of the text prompt; and processing the contextual embedding using a sequence of generative neural networks to generate a final video depicting the scene.

5.

发明公开
GENERATING NEURAL NETWORK OUTPUTS USING INSERTION COMMANDS 审中-公开

公开(公告)号：US20240028893A1

公开(公告)日：2024-01-25

申请号：US18321696

申请日：2023-05-22

Applicant: Google LLC

Inventor： William Chan , Mitchell Thomas Stern , Nikita Kitaev , Kelvin Gu , Jakob D. Uszkoreit

IPC: G06N3/08 , G06F40/237 , G06N3/04 , G06N3/084

CPC classification number: G06N3/08 , G06F40/237 , G06N3/04 , G06N3/084

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing sequence modeling tasks using insertions. One of the methods includes receiving a system input that includes one or more source elements from a source sequence and zero or more target elements from a target sequence, wherein each source element is selected from a vocabulary of source elements and wherein each target element is selected from a vocabulary of target elements; generating a partial concatenated sequence that includes the one or more source elements from the source sequence and the zero or more target elements from the target sequence, wherein the source and target elements arranged in the partial concatenated sequence according to a combined order; and generating a final concatenated sequence that includes a finalized source sequence and a finalized target sequence, wherein the finalized target sequence includes one or more target elements.

6.

发明公开
GENERATING IMAGES USING SEQUENCES OF GENERATIVE NEURAL NETWORKS 审中-公开

公开(公告)号：US20230377226A1

公开(公告)日：2023-11-23

申请号：US18199883

申请日：2023-05-19

Applicant: Google LLC

Inventor： Chitwan Saharia , William Chan , Mohammad Norouzi , Saurabh Saxena , Yi Li , Jay Ha Whang , David James Fleet , Jonathan Ho

IPC: G06T11/60 , G06T3/40 , G06T5/00 , G06F40/40 , G06F40/284 , G06N3/08

CPC classification number: G06T11/60 , G06T3/4053 , G06T5/002 , G06F40/40 , G06F40/284 , G06N3/08

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating images. In one aspect, a method includes: receiving an input text prompt including a sequence of text tokens in a natural language; processing the input text prompt using a text encoder neural network to generate a set of contextual embeddings of the input text prompt; and processing the contextual embeddings through a sequence of generative neural networks to generate a final output image that depicts a scene that is described by the input text prompt.

7.

发明申请
Augmentation of Audiographic Images for Improved Machine Learning 有权

公开(公告)号：US20220012537A1

公开(公告)日：2022-01-13

申请号：US17487548

申请日：2021-09-28

Applicant: Google LLC

Inventor： Daniel Sung-Joon Park , Quoc V. Le , William Chan , Ekin Dogus Cubuk , Barret Zoph , Yu Zhang , Chung-Cheng Chiu

IPC: G06K9/62 , G10L15/16 , G10L15/28 , G10L15/06 , G10L15/12 , G06N20/00

Abstract: Generally, the present disclosure is directed to systems and methods that generate augmented training data for machine-learned models via application of one or more augmentation techniques to audiographic images that visually represent audio signals. In particular, the present disclosure provides a number of novel augmentation operations which can be performed directly upon the audiographic image (e.g., as opposed to the raw audio data) to generate augmented training data that results in improved model performance. As an example, the audiographic images can be or include one or more spectrograms or filter bank sequences.

8.

发明授权
Speech recognition with attention-based recurrent neural networks 有权

公开(公告)号：US11151985B2

公开(公告)日：2021-10-19

申请号：US16713298

申请日：2019-12-13

Applicant: Google LLC

Inventor： William Chan , Navdeep Jaitly , Quoc V. Le , Oriol Vinyals , Noam M. Shazeer

IPC: G10L15/16 , G06N3/04 , G06F40/12 , G06F40/197 , G10L15/183 , G10L15/26 , G10L25/30

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for speech recognition. One method includes obtaining an input acoustic sequence, the input acoustic sequence representing an utterance, and the input acoustic sequence comprising a respective acoustic feature representation at each of a first number of time steps, processing the input acoustic sequence using a first neural network to convert the input acoustic sequence into an alternative representation for the input acoustic sequence, processing the alternative representation for the input acoustic sequence using an attention-based Recurrent Neural Network (RNN) to generate, for each position in an output sequence order, a set of substring scores that includes a respective substring score for each substring in a set of substrings; and generating a sequence of substrings that represent a transcription of the utterance.

9.

发明申请
GENERATING NEURAL NETWORK OUTPUTS USING INSERTION OPERATIONS 有权

公开(公告)号：US20210019477A1

公开(公告)日：2021-01-21

申请号：US16988551

申请日：2020-08-07

Applicant: Google LLC

Inventor： Jakob D. Uszkoreit , Mitchell Thomas Stern , Jamie Ryan Kiros , William Chan

IPC: G06F40/44 , G06N3/04 , G06N3/08 , G06N5/04

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating network outputs using insertion operations.

10.

发明授权
Generating videos using sequences of generative neural networks 有权

公开(公告)号：US12277758B2

公开(公告)日：2025-04-15

申请号：US18400856

申请日：2023-12-29

Applicant: Google LLC

Inventor： Jonathan Ho , William Chan , Chitwan Saharia , Jay Ha Whang , Tim Salimans

IPC: G06V10/82 , G06T3/4053

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium. In one aspect, a method includes receiving a text prompt describing a scene; processing the text prompt using a text encoder neural network to generate a contextual embedding of the text prompt; and processing the contextual embedding using a sequence of generative neural networks to generate a final video depicting the scene.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification