UNSUPERVISED ZERO-SHOT SEGMENTATION MASK GENERATION AND SEMANTIC LABELING

    公开(公告)号:US20250045930A1

    公开(公告)日:2025-02-06

    申请号:US18792206

    申请日:2024-08-01

    Applicant: Google LLC

    Abstract: Implementations relate to generation of segmentation masks for images in a zero-shot, unsupervised manner. Implementations also relate to generation of labels for the segmentation layers of the segmentation mask. Implementations use self-attention maps from a pass of the image through a generative image model to determine the segmentation mask and may use cross-attention maps generated when a prompt describing the image is provided with the image to the generative image model. Implementations aggregate maps from different resolutions to determine the mask and labels. The disclosed techniques enable accurate segmentation for any image without apriori training, facilitating applications in image processing, computer vision, extended reality applications, and robotics.

Patent Agency Ranking