-
公开(公告)号:US20240257510A1
公开(公告)日:2024-08-01
申请号:US18289725
申请日:2021-08-06
Applicant: GOOGLE LLC
Inventor: Weicheng Kuo , Tsung-Yi Lin , Anelia Angelova , Dahun Kim
IPC: G06V10/82 , G06V10/44 , G06V10/774 , G06V10/776 , G06V20/00
CPC classification number: G06V10/82 , G06V10/44 , G06V10/774 , G06V10/776 , G06V20/00
Abstract: An object localization network (OLN) can be used to localize object(s) (e.g., known and/or unknown object(s)) in an instance of vision data. Various implementations include detecting the localized object(s) based on the localization. Many implementations include processing the instance of vision data using the OLN to generate a objectness score (e.g., a centerness score) as well as an intersection of union (IoU) score for one or more proposed object locations in the instance of vision data. Object(s) can be localized in the instance of vision data based on the objectness scores and the IoU scores.
-
公开(公告)号:US20240037926A1
公开(公告)日:2024-02-01
申请号:US18379532
申请日:2023-10-12
Applicant: Google LLC
Inventor: Weicheng Kuo , Anelia Angelova , Tsung-Yi Lin
IPC: G06V10/82 , G06V10/26 , G06V10/25 , G06V20/10 , G06V10/764 , G06V10/77 , G06V10/44 , G06T7/10 , G06V10/774
CPC classification number: G06V10/82 , G06V10/26 , G06V10/25 , G06V20/10 , G06V10/764 , G06V10/7715 , G06V10/454 , G06T7/10 , G06V10/774 , G06T2207/20081
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing instance segmentation by detecting and segmenting individual objects in an image. In one aspect, a method comprises: processing an image to generate data identifying a region of the image that depicts a particular object; obtaining data defining a plurality of example object segmentations; generating a respective weight value for each of the example object segmentations; for each of a plurality of pixels in the region of the image, determining a score characterizing a likelihood that the pixel is included in the particular object depicted in the region of the image using: (i) the example object segmentations, and (ii) the weight values for the example object segmentations; and generating a segmentation of the particular object depicted in the region of the image using the scores for the pixels in the region of the image.
-
公开(公告)号:US11823443B2
公开(公告)日:2023-11-21
申请号:US17290814
申请日:2019-08-14
Applicant: Google LLC
Inventor: Weicheng Kuo , Anelia Angelova , Tsung-Yi Lin
IPC: G06V10/82 , G06T7/10 , G06V10/26 , G06V10/25 , G06V20/10 , G06V10/764 , G06V10/77 , G06V10/44 , G06V10/774
CPC classification number: G06V10/82 , G06T7/10 , G06V10/25 , G06V10/26 , G06V10/454 , G06V10/764 , G06V10/774 , G06V10/7715 , G06V20/10 , G06T2207/20081
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing instance segmentation by detecting and segmenting individual objects in an image. In one aspect, a method comprises: processing an image to generate data identifying a region of the image that depicts a particular object; obtaining data defining a plurality of example object segmentations; generating a respective weight value for each of the example object segmentations; for each of a plurality of pixels in the region of the image, determining a score characterizing a likelihood that the pixel is included in the particular object depicted in the region of the image using: (i) the example object segmentations, and (ii) the weight values for the example object segmentations; and generating a segmentation of the particular object depicted in the region of the image using the scores for the pixels in the region of the image.
-
公开(公告)号:US20220301298A1
公开(公告)日:2022-09-22
申请号:US17697750
申请日:2022-03-17
Applicant: Google LLC
Inventor: Tsung-Yi Lin , Barret Zoph , Ekin Dogus Cubuk , Golnaz Ghiasi , Quoc V. Le
IPC: G06V10/82 , G06N3/08 , G06V10/774 , G06V10/77 , G06V10/776 , G06V10/764 , G06V10/80
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training an image representation neural network.
-
公开(公告)号:US11301733B2
公开(公告)日:2022-04-12
申请号:US16416848
申请日:2019-05-20
Applicant: Google LLC
Inventor: Jon Shlens , Ekin Dogus Cubuk , Quoc Le , Tsung-Yi Lin , Barret Zoph , Golnaz Ghiasi
Abstract: Example aspects of the present disclosure are directed to systems and methods for learning data augmentation strategies for improved object detection model performance. In particular, example aspects of the present disclosure are directed to iterative reinforcement learning approaches in which, at each of a plurality of iterations, a controller model selects a series of one or more augmentation operations to be applied to training images to generate augmented images. For example, the controller model can select the augmentation operations from a defined search space of available operations which can, for example, include operations that augment the training image without modification of the locations of a target object and corresponding bounding shape within the image and/or operations that do modify the locations of the target object and bounding shape within the training image.
-
公开(公告)号:US20220108204A1
公开(公告)日:2022-04-07
申请号:US17061355
申请日:2020-10-01
Applicant: Google LLC
Inventor: Xianzhi Du , Yin Cui , Tsung-Yi Lin , Quoc V. Le , Pengchong Jin , Mingxing Tan , Golnaz Ghiasi , Xiaodan Song
Abstract: A computer-implemented method of generating scale-permuted models can generate models having improved accuracy and reduced evaluation computational requirements. The method can include defining, by a computing system including one or more computing devices, a search space including a plurality of candidate permutations of a plurality of candidate feature blocks, each of the plurality of candidate feature blocks having a respective scale. The method can include performing, by the computing system, a plurality of search iterations by a search algorithm to select a scale-permuted model from the search space, the scale-permuted model based at least in part on a candidate permutation of the plurality of candidate permutations.
-
公开(公告)号:US20220092387A1
公开(公告)日:2022-03-24
申请号:US17433677
申请日:2020-02-25
Applicant: Google LLC
Inventor: Quoc V. Le , Golnaz Ghiasi , Tsung-Yi Lin
IPC: G06N3/04
Abstract: A computing system for producing an architecture of a pyramid layer is disclosed. The computing system can include a controller model configured to generate new architectures for a pyramid layer that receives a plurality of input feature representations output by a backbone model and, in response, outputs a plurality of output feature representations. The plurality of input feature representations can have a plurality of different input resolutions, and the plurality of output feature representations can have a plurality of different output resolutions. The computing system can be configured to perform a plurality of iterations. For each iteration, the computing system can receive a new pyramid layer architecture as an output of the controller model and evaluate one or more performance characteristics of a machine-learned pyramidal feature model that includes the backbone model and one or more pyramid layers that have the new pyramid layer architecture.
-
公开(公告)号:US20230230275A1
公开(公告)日:2023-07-20
申请号:US18011601
申请日:2021-11-15
Applicant: Google LLC
Inventor: Tsung-Yi Lin , Peter Raymond Florence , Yen-Chen Lin , Jonathan Tilton Barron
IPC: G06T7/70
CPC classification number: G06T7/70 , G06T2207/20081 , G06T2207/20084 , G06T2207/30244
Abstract: Provided are systems and methods that invert a trained NeRF model, which stores the structure of a scene or object, to estimate the 6D pose from an image taken with a novel view. 6D pose estimation has a wide range of applications, including visual localization and object pose estimation for robot manipulation.
-
公开(公告)号:US11682191B2
公开(公告)日:2023-06-20
申请号:US17702438
申请日:2022-03-23
Applicant: Google LLC
Inventor: Jon Shlens , Ekin Dogus Cubuk , Quoc Le , Tsung-Yi Lin , Barret Zoph , Golnaz Ghiasi
CPC classification number: G06V10/772 , G06F18/217 , G06F18/24 , G06T3/20 , G06T3/60 , G06T11/001
Abstract: Example aspects of the present disclosure are directed to systems and methods for learning data augmentation strategies for improved object detection model performance. In particular, example aspects of the present disclosure are directed to iterative reinforcement learning approaches in which, at each of a plurality of iterations, a controller model selects a series of one or more augmentation operations to be applied to training images to generate augmented images. For example, the controller model can select the augmentation operations from a defined search space of available operations which can, for example, include operations that augment the training image without modification of the locations of a target object and corresponding bounding shape within the image and/or operations that do modify the locations of the target object and bounding shape within the training image.
-
公开(公告)号:US20190354817A1
公开(公告)日:2019-11-21
申请号:US16416848
申请日:2019-05-20
Applicant: Google LLC
Inventor: Jon Shlens , Ekin Dogus Cubuk , Quoc Le , Tsung-Yi Lin , Barret Zoph , Golnaz Ghiasi
Abstract: Example aspects of the present disclosure are directed to systems and methods for learning data augmentation strategies for improved object detection model performance. In particular, example aspects of the present disclosure are directed to iterative reinforcement learning approaches in which, at each of a plurality of iterations, a controller model selects a series of one or more augmentation operations to be applied to training images to generate augmented images. For example, the controller model can select the augmentation operations from a defined search space of available operations which can, for example, include operations that augment the training image without modification of the locations of a target object and corresponding bounding shape within the image and/or operations that do modify the locations of the target object and bounding shape within the training image.
-
-
-
-
-
-
-
-
-