-
公开(公告)号:US20240289926A1
公开(公告)日:2024-08-29
申请号:US18564915
申请日:2022-05-27
Applicant: Google LLC
Inventor: Carlos Riquelme Ruiz , André Susano Pinto , Basil Mustafa , Daniel M. Keysers , Joan Puigcerver i Perez , Maxim Neumann , Neil Matthew Tinmouth Houlsby , Rodolphe Jenatton
IPC: G06T5/60
CPC classification number: G06T5/60 , G06T2207/20084
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating predictions about images. One of the systems includes a neural network comprising a sequence of one or more network blocks that are each configured to perform operations comprising: obtaining a block input that represents an intermediate representation of an input image; determining a plurality of patches of the block input or of an updated representation of the block input, wherein each patch comprises a different subset of elements of the block input or of the updated representation of the block input; assigning each patch to one or more respective expert modules of a plurality of expert modules of the network block; for each patch of the plurality of patches, processing the patch using the corresponding expert modules to generate respective module outputs; and generating a block output by combining the module outputs.