-
公开(公告)号:US11238593B2
公开(公告)日:2022-02-01
申请号:US16789088
申请日:2020-02-12
Applicant: Adobe Inc.
Inventor: Kerem Can Turgutlu , Jayant Kumar , Jianming Zhang , Zhe Lin
Abstract: Techniques are disclosed for parsing a source image, to identify segments of one or more objects within the source image. The parsing is carried out by an image parsing pipeline that includes three distinct stages comprising three respectively neural network models. The source image can include one or more objects. A first neural network model of the pipeline identifies a section of the source image that includes the object comprising a plurality of segments. A second neural network model of the pipeline generates, from the section of the source image, a mask image, where the mask image identifies one or more segments of the object. A third neural network model of the pipeline further refines the identification of the segments in the mask image, to generate a parsed image. The parsed image identifies the segments of the object, by assigning corresponding unique labels to pixels of different segments of the object.
-
公开(公告)号:US20210248748A1
公开(公告)日:2021-08-12
申请号:US16789088
申请日:2020-02-12
Applicant: Adobe Inc.
Inventor: Kerem Can Turgutlu , Jayant Kumar , Jianming Zhang , Zhe Lin
Abstract: Techniques are disclosed for parsing a source image, to identify segments of one or more objects within the source image. The parsing is carried out by an image parsing pipeline that includes three distinct stages comprising three respectively neural network models. The source image can include one or more objects. A first neural network model of the pipeline identifies a section of the source image that includes the object comprising a plurality of segments. A second neural network model of the pipeline generates, from the section of the source image, a mask image, where the mask image identifys one or more segments of the object. A third neural network model of the pipeline further refines the identification of the segments in the mask image, to generate a parsed image. The parsed image identifies the segments of the object, by assigning corresponding unique labels to pixels of different segments of the object.
-
公开(公告)号:US20230274478A1
公开(公告)日:2023-08-31
申请号:US17652512
申请日:2022-02-25
Applicant: ADOBE INC.
Inventor: Kerem Can Turgutlu , Sanat Sharma , Jayant Kumar , Rohith Mohan Dodle , Vipul Dalal
IPC: G06T11/60 , G06V10/764 , G06V20/70 , G06V10/774 , G06V10/82
CPC classification number: G06T11/60 , G06V10/764 , G06V20/70 , G06V10/774 , G06V10/82 , G06T2210/12 , G06T2210/61
Abstract: Systems and methods for image processing are described. Embodiments of the present disclosure receive an image depicting an object; generate a sequence of tokens including a set of tokens corresponding to the object and a set of mask tokens corresponding to an additional object to be inserted into the image; generate a placement token value for the set of mask tokens based on the sequence of tokens using a sequence encoder, wherein the placement token value represents position information of the additional object; and insert the additional object into the image based on the position information to obtain a composite image.
-
-