-
公开(公告)号:US12277758B2
公开(公告)日:2025-04-15
申请号:US18400856
申请日:2023-12-29
Applicant: Google LLC
Inventor: Jonathan Ho , William Chan , Chitwan Saharia , Jay Ha Whang , Tim Salimans
IPC: G06V10/82 , G06T3/4053
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium. In one aspect, a method includes receiving a text prompt describing a scene; processing the text prompt using a text encoder neural network to generate a contextual embedding of the text prompt; and processing the contextual embedding using a sequence of generative neural networks to generate a final video depicting the scene.
-
公开(公告)号:US20240249456A1
公开(公告)日:2024-07-25
申请号:US18624960
申请日:2024-04-02
Applicant: Google LLC
Inventor: Chitwan Saharia , William Chan , Mohammad Norouzi , Saurabh Saxena , Yi Li , Jay Ha Whang , David James Fleet , Jonathan Ho
IPC: G06T11/60 , G06F40/284 , G06F40/40 , G06N3/08 , G06T3/4053 , G06T5/70
CPC classification number: G06T11/60 , G06F40/284 , G06F40/40 , G06N3/08 , G06T3/4053 , G06T5/70
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating images. In one aspect, a method includes: receiving an input text prompt including a sequence of text tokens in a natural language; processing the input text prompt using a text encoder neural network to generate a set of contextual embeddings of the input text prompt; and processing the contextual embeddings through a sequence of generative neural networks to generate a final output image that depicts a scene that is described by the input text prompt.
-
公开(公告)号:US20240320965A1
公开(公告)日:2024-09-26
申请号:US18400856
申请日:2023-12-29
Applicant: Google LLC
Inventor: Jonathan Ho , William Chan , Chitwan Saharia , Jay Ha Whang , Tim Salimans
IPC: G06V10/82 , G06T3/4053
CPC classification number: G06V10/82 , G06T3/4053
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium. In one aspect, a method includes receiving a text prompt describing a scene; processing the text prompt using a text encoder neural network to generate a contextual embedding of the text prompt; and processing the contextual embedding using a sequence of generative neural networks to generate a final video depicting the scene.
-
公开(公告)号:US11978141B2
公开(公告)日:2024-05-07
申请号:US18199883
申请日:2023-05-19
Applicant: Google LLC
Inventor: Chitwan Saharia , William Chan , Mohammad Norouzi , Saurabh Saxena , Yi Li , Jay Ha Whang , David James Fleet , Jonathan Ho
IPC: G06T11/60 , G06F40/284 , G06F40/40 , G06N3/08 , G06T3/40 , G06T3/4053 , G06T5/00
CPC classification number: G06T11/60 , G06F40/284 , G06F40/40 , G06N3/08 , G06T3/4053 , G06T5/002
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating images. In one aspect, a method includes: receiving an input text prompt including a sequence of text tokens in a natural language; processing the input text prompt using a text encoder neural network to generate a set of contextual embeddings of the input text prompt; and processing the contextual embeddings through a sequence of generative neural networks to generate a final output image that depicts a scene that is described by the input text prompt.
-
公开(公告)号:US11908180B1
公开(公告)日:2024-02-20
申请号:US18126281
申请日:2023-03-24
Applicant: Google LLC
Inventor: Jonathan Ho , William Chan , Chitwan Saharia , Jay Ha Whang , Tim Salimans
CPC classification number: G06V10/82 , G06T3/4053
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium. In one aspect, a method includes receiving a text prompt describing a scene; processing the text prompt using a text encoder neural network to generate a contextual embedding of the text prompt; and processing the contextual embedding using a sequence of generative neural networks to generate a final video depicting the scene.
-
公开(公告)号:US20230377226A1
公开(公告)日:2023-11-23
申请号:US18199883
申请日:2023-05-19
Applicant: Google LLC
Inventor: Chitwan Saharia , William Chan , Mohammad Norouzi , Saurabh Saxena , Yi Li , Jay Ha Whang , David James Fleet , Jonathan Ho
CPC classification number: G06T11/60 , G06T3/4053 , G06T5/002 , G06F40/40 , G06F40/284 , G06N3/08
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating images. In one aspect, a method includes: receiving an input text prompt including a sequence of text tokens in a natural language; processing the input text prompt using a text encoder neural network to generate a set of contextual embeddings of the input text prompt; and processing the contextual embeddings through a sequence of generative neural networks to generate a final output image that depicts a scene that is described by the input text prompt.
-
-
-
-
-