-
公开(公告)号:US20250139959A1
公开(公告)日:2025-05-01
申请号:US18690550
申请日:2022-09-19
Applicant: Google LLC
Inventor: Ting Chen , Saurabh Saxena , Yi Li , Geoffrey E. Hinton , David James Fleet
IPC: G06V10/82 , G06V10/764 , G06V10/774 , G06V10/776 , G06V20/70
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for object detection using neural networks. In one aspect, one of the methods includes obtaining an input image; processing the input image using an object detection neural network to generate an output sequence that comprises respective token at each of a plurality of time steps, wherein each token is selected from a vocabulary of tokens that comprises (i) a first set of tokens that each represent a respective discrete number from a set of discretized numbers and (ii) a second set of tokens that each represent a respective object category from a set of object categories; and generating, from the tokens in the output sequence, an object detection output for the input image.
-
公开(公告)号:US20240249456A1
公开(公告)日:2024-07-25
申请号:US18624960
申请日:2024-04-02
Applicant: Google LLC
Inventor: Chitwan Saharia , William Chan , Mohammad Norouzi , Saurabh Saxena , Yi Li , Jay Ha Whang , David James Fleet , Jonathan Ho
IPC: G06T11/60 , G06F40/284 , G06F40/40 , G06N3/08 , G06T3/4053 , G06T5/70
CPC classification number: G06T11/60 , G06F40/284 , G06F40/40 , G06N3/08 , G06T3/4053 , G06T5/70
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating images. In one aspect, a method includes: receiving an input text prompt including a sequence of text tokens in a natural language; processing the input text prompt using a text encoder neural network to generate a set of contextual embeddings of the input text prompt; and processing the contextual embeddings through a sequence of generative neural networks to generate a final output image that depicts a scene that is described by the input text prompt.
-
公开(公告)号:US11574339B1
公开(公告)日:2023-02-07
申请号:US16705919
申请日:2019-12-06
Applicant: Google LLC
Inventor: Yew Jin Lim , James Kunz , Randolph Gregory Brown , Beidou Wang , David Hou , Kyle Zaragoza , Yi Li , Nikita Kirnosov , Tao Feng
Abstract: Processor(s) of a client device can: analyze one or more features of an electronic resource that is under consideration for solicitation to a user; determine a notification likelihood that the user will access the electronic resource in response to an unsolicited notification of the electronic resource being output to the user; determine a baseline likelihood that the user will access the electronic resource without being solicited; compare the notification likelihood with the baseline likelihood; and cause, based on the comparing, the unsolicited notification to be output to the user. In some implementations, determining the notification likelihood and/or the baseline likelihood is based on applying data associated with the electronic resource as input across a machine learning model to generate output indicative of the notification likelihood and/or the baseline likelihood. In other implementations, determining the notification likelihood and/or the baseline likelihood is based on past behavior or preference(s) of the user.
-
公开(公告)号:US11978141B2
公开(公告)日:2024-05-07
申请号:US18199883
申请日:2023-05-19
Applicant: Google LLC
Inventor: Chitwan Saharia , William Chan , Mohammad Norouzi , Saurabh Saxena , Yi Li , Jay Ha Whang , David James Fleet , Jonathan Ho
IPC: G06T11/60 , G06F40/284 , G06F40/40 , G06N3/08 , G06T3/40 , G06T3/4053 , G06T5/00
CPC classification number: G06T11/60 , G06F40/284 , G06F40/40 , G06N3/08 , G06T3/4053 , G06T5/002
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating images. In one aspect, a method includes: receiving an input text prompt including a sequence of text tokens in a natural language; processing the input text prompt using a text encoder neural network to generate a set of contextual embeddings of the input text prompt; and processing the contextual embeddings through a sequence of generative neural networks to generate a final output image that depicts a scene that is described by the input text prompt.
-
公开(公告)号:US20230377226A1
公开(公告)日:2023-11-23
申请号:US18199883
申请日:2023-05-19
Applicant: Google LLC
Inventor: Chitwan Saharia , William Chan , Mohammad Norouzi , Saurabh Saxena , Yi Li , Jay Ha Whang , David James Fleet , Jonathan Ho
CPC classification number: G06T11/60 , G06T3/4053 , G06T5/002 , G06F40/40 , G06F40/284 , G06N3/08
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating images. In one aspect, a method includes: receiving an input text prompt including a sequence of text tokens in a natural language; processing the input text prompt using a text encoder neural network to generate a set of contextual embeddings of the input text prompt; and processing the contextual embeddings through a sequence of generative neural networks to generate a final output image that depicts a scene that is described by the input text prompt.
-
-
-
-