Multi-scale Transformer for Image Analysis
    1.
    发明公开

    公开(公告)号:US20240119555A1

    公开(公告)日:2024-04-11

    申请号:US18527528

    申请日:2023-12-04

    申请人: Google LLC

    IPC分类号: G06T3/00 G06T3/40 G06T7/00

    摘要: The technology employs a patch-based multi-scale Transformer (300) that is usable with various imaging applications. This avoids constraints on image fixed input size and predicts the quality effectively on a native resolution image. A native resolution image (304) is transformed into a multi-scale representation (302), enabling the Transformer's self-attention mechanism to capture information on both fine-grained detailed patches and coarse-grained global patches. Spatial embedding (316) is employed to map patch positions to a fixed grid, in which patch locations at each scale are hashed to the same grid. A separate scale embedding (318) is employed to distinguish patches coming from different scales in the multiscale representation. Self-attention (508) is performed to create a final image representation. In some instances, prior to performing self-attention, the system may prepend a learnable classification token (322) to the set of input tokens.

    Preserving regions of interest in automatic image cropping

    公开(公告)号:US11663762B2

    公开(公告)日:2023-05-30

    申请号:US17083899

    申请日:2020-10-29

    申请人: Adobe Inc.

    IPC分类号: G06T11/60 G06T3/00 G06T7/12

    摘要: Embodiments of the present invention are directed to facilitating region of interest preservation. In accordance with some embodiments of the present invention, a region of interest preservation score using adaptive margins is determined. The region of interest preservation score indicates an extent to which at least one region of interest is preserved in a candidate image crop associated with an image. A region of interest positioning score is determined that indicates an extent to which a position of the at least one region of interest is preserved in the candidate image crop associated with the image. The region of interest preservation score and/or the preserving score are used to select a set of one or more candidate image crops as image crop suggestions.

    Distortion correction via modified analytical projection

    公开(公告)号:US11663704B2

    公开(公告)日:2023-05-30

    申请号:US17243405

    申请日:2021-04-28

    摘要: Examples are disclosed relating to applying an analytical geometric projection that has been modified by an amplitude function. One example provides a computing device comprising a logic subsystem and a storage subsystem holding instructions executable by the logic subsystem to receive an image of a scene as acquired by an image sensor, apply a mapping to the image of the scene that maps pixels of the image to projected pixels on an analytical projection that is modified by an amplitude function such that the analytical projection achieves a higher zoom effect on pixels closer to a center of the image compared to pixels closer to an edge of the image, thereby obtaining a corrected image, and output the corrected image.